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SUMMARY 

This  report  summarizes  the  results  of  a  three  year  study  sponsored  by  the  Air  Force  Office  of  Spon¬ 
sored  Research  under  contract  No.  F49620-92-J-0496.  The  enthusiastic  technical  and  administra¬ 
tive  effort  of  Drs.  Spencer  Wu  and  Brian  Sanders  of  AFOSR  are  warmly  acknowledged. 

This  project  has  involved  analytical  and  experimental  research  across  a  family  of  structural  me¬ 
chanics  and  control  problems.  Our  effort  has  been  mainly  addressed  to  four  sets  of  research  issues: 

1 .  Solution  and  Validation  Methodology  for  Simulation  of  Nonlinear  Structural  Systems 
See  Attachments  [2,3,14]. 

2.  NonlinearMechanics  and  Control  of  Flexible  Structural  and  Robotic  Systems 

See  Attachments  [4-8,14-18]. 

3.  Representation  of  Finite  Rotations  in  3  and  N-Dimensions:  Applications  in  Mechanics 

See  Attachments  [9-11,13]. 

4.  Radial  Basis  Approximation  Methods  and  Associated  Optimization  Algorithms 

See  Attachments  [12]. 

In  addition  to  the  above  four  sets  of  research  issues,  we  have  also  engaged  in  significant  re¬ 
search  on  ancillary  topics  which  are  documented  in  the  references  listed  in  Attachment  1.  The 
above  research  spans  a  broad  set  of  theoretical/conceptual  [6,7,9-11,13-18],  computational  [2- 
4,12,14],  and  hardware  experimental  [8]  research  topics. 

In  the  text  of  this  report,  we  present  a  brief  guided  tour  of  the  results  as  a  preamble  to  the  nine¬ 
teen  attachments  which  present  the  details  of  the  research  methodology  and  results. 
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1.0  Introduction 

This  report  presents  results  achieved  during  a  three  year  research  project  at  Texas  A&M 
University  sponsored  by  AFOSR  under  contract  F49620-92-J-0496  POOOl.  The  work  was  carried 
out  by  the  Principal  Investigator  (J.  L.  Junkins)  and  a  team  of  mainly  Ph.D.  candidate  co-research¬ 
ers.  As  is  evident  from  a  brief  review  of  the  attachments,  a  substantial  volume  of  research  results 
have  emerged  from  this  work.  Given  the  volume  of  results,  we  decided  to  overview  only  the  main 
features  of  the  results  in  the  text,  and  make  the  technically  more  detailed  attachments  the  heart  of 
our  report. 

The  level  of  effort  required  to  produce  the  attached  results  represents  approximately  five 
man-years  of  total  effort.  Since  only  half  that  level  of  effort  was  funded  by  contract  F49620-92-J- 
0496,  it  is  evident  that  the  matching  State  of  Texas  support  (Advanced  Technology  Project  Num¬ 
bers  999903-231  and  999903-232)  has  resulted  in  an  augmentation  of  this  project  which  consider¬ 
ably  leveraged  the  AFOSR  support. 

This  report  documents  our  results  in  four  broad  categories; 

Solution  and  Validation  Methodology  for  Simulation  of  Nonlinear  Structural  Systems 
Nonlinear  Mechanics  and  Control  of  Flexible  Structural  and  Robotic  Systems 
Representation  of  Finite  Rotations  in  3  and  N-Dimensions:  Applications  in  Mechanics 
Radial  Basis  Approximation  Methods  and  Associated  Optimization  Algorithms 

Attachment  No.  1  lists  19  refereed  publications  that  have  been  the  result  of  this  work  during  1993- 
1996,  and  also  lists  the  graduate  students  that  have  been  supported  under  this  contract.  In  addition, 
two  additional  students  and  a  post-doctoral  researcher  have  been  supported  under  support  of  State 
of  Texas  support  (Advanced  Technology  Project  Numbers  999903-231  and  999903-232)  perform¬ 
ing  ancillary  research. 

The  discussion  below  overviews  selected  aspects  of  the  contribution  in  each  of  the  above 
categories;  the  details  are  covered  in  the  attachments. 

2.0  Selected  of  Technical  Results 

In  Attachment  [2,3],  we  present  some  very  significant  results  from  this  research  project;  we 
have  developed  methodology  for  validation  of  solution  accuracy  of  nonlinear  dynamical  response. 
This  methodology  applies  to  a  wide  class  of  physical  systems  modeled  as  systems  of  ordinary,  par¬ 
tial,  or  integro  differential  equations  and  associated  boundary  condition  operators.  It  permits  the 
anal)4ical  construction  of  exact  solutions  (along  with  rigorously  consistent,  small  perturbing  force 
functions),  which  neighbor  given  approximate  numerical  solutions.  We  show  that  is  is  possible  to 
construct  these  special  case  exact  solutions  in  spite  of  the  fact  that  the  original  initial  value  problem 
cannot  be  solved  exactly  in  closed  form.  The  research  reported  in  these  papers  consist  of  basic  an¬ 
alytical  results  and  a  careful  proof-of-concept  experiments  for  several  example  systems  described 
by  ordinary  and  partial  differential  equation  systems.  For  a  wide  class  of  nonlinear  dynamical  sys¬ 
tems  described  by  ordinary  differential  equations,  we  have  developed  an  algorithm  and  software 
that  represent  a  standardized  approach  which  promises  to  be  of  broad  utility.  For  the  class  of  dis¬ 
tributed  parameter  systems,  we  have  worked  several  examples  and  established  proof  of  concept, 
however,  we  have  not  found  it  feasible  to  construct  a  general  purpose  software  package  for  this 
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case.  Shown  below  in  Figure  1  is  a  slide  format  result  abstracted  from  Attachment  3;  we  depict  the 
error  surfaces  between  a  family  of  approximate  response  solutions  compared  to  an  exact  solution  we 
constructed  using  the  method  of  Attachments  [1,2]. 


Figure  1. 


^♦niphirfll  Mpfhflnics:  Nonlinear  Response  MethodologY 

^  Means  for  Snlntion  Validation  -  Jiinkins  (Texas  A&M) 

•  A  method  for  construction  ot  exact  benchmark  solutions  neighboring  available  aproxlmate  solutions. 

•  Provides  capability  to  determine  exact  special  case  space-time  solution  errors  of  numerical  methods. 

•  Permits  rigorous  validation  of  numerical  methods  =>  assess  accuracy  limitations  of  methodoK^y. 

•  Permits  optimal  tuning  of  a  given  numerical  method  (e.g.,  select  step  size,  ITEM  grid,  order,  etc.). 

•  Permits  rigorous  tradeoff  studies  for  evaluating  merits  of  competing  numerical  methods. 

Refs.:  Junldns,  J.  and  Lee,  S.,  "VaUdation  of  Rnite  Dimensional  ApproximaUon  Solutions  for  Dynamics  of  Distributed  Parameter  Systems," 

Adv,  in  the  Astron.  Sci£ttces,\dL  pp.  2089-2111  (1994)  .  •  ci.  ».  ^  Vni  i  Nn  S  on  403-414  n9941 

Lee,  S.,  and  Junldns,  J.,  "Constniction  of  Exact  Benchmark  Problems  for  Dynamical  Systems,  Shock  and  Vibration,\ol.  I,  No.  5,  pp.  403-414  (1994). 

Absolute  Error  Convereence  Study:  ||  Multi-Flexible  Body  Example  =>  Hybrid  ODE/PDE  Sptem 

NlnJLr  OsclIlato^Example  Norm  of  Exact  Solution  Errors  vs  Space-Time  Step  Sizes: 

,  e(x,t)  -  [approx  solution] 

y  mintiv  ffir/irt  SOlution] 


absolute  errors  vs  step  size 
I  (benchmark  problem) 


nonlinear  response  vs  time 


19*  i(r».  Jt* 

step  size  h  [sj 


absolute  errors  vs  step  size 

1(20%  perturbed  problem)y  ' 


ktW 


,0-12 


time  [s] 


exact  errors  vs  time 

- 7L~: _ I _ ^-1, 


In  Attachments  [4,5,15,18],  we  present  a  substantial  volume  of  new  material  on  stability  and 
control  of  multi-body  structural  systems  and,  in  particular,  explore  some  of  the  conceptual,  mathe¬ 
matical,  and  numerical  issues  in  underlying  cooperation  between  two  or  more  autonomously  con¬ 
trolled  manipulators  maneuvering  a  common  paylod  or  object.  For  the  typical  case  of  redundant 
actuation,  there  are  an  infinity  of  controls  to  affect  essentially  the  same  dyn^cal  inotion,  however, 
each  control  policy  and  resulting  control  forces  represent  different  constraint  loading  on  the  struc¬ 
ture.  A  familar  example  is  two  or  more  humans  manipulating  a  heavy  object  such  as  a  soffa  or  a  pool 
table;  it  is  apparent  that,  due  to  actuator  redundancy,  the  same  rigid  body  trajectory  can  be  acheived 
by  an  infinity  of  actuation  forces,  but  most  of  these  control  policies  result  in  the  actuators  ‘fighting 
each  other  and  imposing  unnecessary  constraint  loads  on  the  payload  (and  frastration  of  the  actua¬ 
tors).  By  defining  an  appropriate  optimization  policy,  it  is  possible  to  minimize  the  norm  of  the  con¬ 
straint  forces,  for  example,  and  thereby  cause  the  manipulators  to  cooperate  in  carrying  out  the 
maneuver.  In  Attachments  [4,5],  we  develop  a  conceptual  and  mathematical  basis  for  formulating 
cooperative  optimal  control  strategies  and  study  the  efficacy  and  robustness  of  this  approach  through 
several  simulation  studies.  Recently,  Agrawal  and  his  student  Gary  Yale  at  the  Naval  Postgraduate 
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School  have  successfuUy  implemented  this  idea  experimentally  in  collaboration  with  the  Pnncipal 
Investigator,  and  have  verified  that  the  approach  has  practical  validity  as  well  as  theoretical  ele¬ 
gance. 

In  attachments  [6],  we  extend  the  classical  linear  quadratic  regulator  (LQR)  to  admit  ine- 
Quality  constraints  on  the  control  variables.  This  modest  extension  of  the  LQR  is  very  significant, 
because  one  of  the  classical  shortcomings  of  the  LQR  is  that  there  was  no  apriori  guarantee  that 
the  opt  control  derived  was  in  fact  physically  realizable.  A  numerical  example  is  given  m  [6],  to 
illustrate  that  the  algorithm  obtained  is  indeed  numerically  feasible. 

In  Attachment  [7],  we  present  an  analytical  result;  we  introduce  a  novel  theoretical  path  for 
asymptotic  stabiUty  analysis  for  systems  wherein  the  chosen  Lyapunov  function  is  negative  semi- 
definite.  We  use  the  new  methodololgy  to  show  that  a  commonly  applied  output  feedback  control 
law  (for  controlling  a  symmetric  four  appendage  structure)  guarantees  asympotic  stability  of  ml  m- 
fmity  of  the  antisymmetric-  in-unison  modes,  however,  it  does  not  guarantee  the  stability  of  the 
infinity  of  antisymmetric-  in-  opposition  modes  which  are  both  unobservable  and  uncontrollable. 

Attachment  [8]  presents  analytical,  computational,  and  experiment^  results  for  near  mini¬ 
mum-fuel  and  near-minimum-time  control  of  the  ASTREX  structure  at  Phillips  Laboratory.  The 
results  in  [8]  establish  the  validity  and  effectiveness  of  our  overall  approach,  however  some  exper¬ 
imental  anomalies  were  revealed  due  to  several  constraints  imposed  by  the  present  sensor/actutor 
system  development. 

In  attachments  [9-1 1,13],  we  present  another  significant  result  of  our  research  that  we  ex¬ 
pect  to  have  important  consequences.  We  have  been  able  to  greatly  extend  and  generalize  a  fun¬ 
damental  classical  result  known  as  the  Cayley  Transform,  to  establish  a  revolutiona^  method  for 
parameterization  of  NxN  proper  orthogonal  matrices.  These  results  permit  one  to  view  the  evlo- 
lution  of  an  NxN  orthogonal  matrix  in  terms  of  a  minimal  [N(N-l)/2-dimensional]  set  of  onenta- 
tion  parameters’  that  are  closely  related  to  the  quaternions  or  Euler  Parameters  famous  for  the 
usual  3x3  orthogonal  direction  cosine  matrix  case.  Thus  the  evolution  of  an  NxN  orthogonal  ma¬ 
trix  can  be  qualitatively  conceptualized  as  the  motion  of  a  generalized  rigid  body  reference  frame. 
Since  the  spectral  decomposition  of  all  NxN  symmetric  positive  definite  matrices  (which  abound 
in  mechanics!)  is  a  similarity  transformation  involving  the  orthogonal  NxN  matrix  of  eigenvectors 
and  the  N  positive  scalar  eigenvalues,  it  is  apparent  that  nonsingular  minimal  parameter  descrip¬ 
tions  of  orthogonal  matrices  immediately  enables  minimal  paraineter  descriptions  of  a  general  pos¬ 
itive  definite  N*N  matrices.  Several  applications  are  considered  in  the  references  that  illustrate  the 
utUity  and  support  the  conclusion  that  these  results  are  fundamental  in  nature  and  will  have  a  broad 
impact. 

In  attachment  [12],  we  present  a  method  for  converting  a  general  functional  optimzation 
problem  into  a  nonlinear  progranuning  problem  by  prameterizing  the  unknown  control  using 
basis  functions  (RBFs).  An  adaptive  RBF  approximation  method  is  introduced  wherein  an  initially 
small  number  of  basis  functions  is  gradually  increased  with  the  center  locations  being  decided 
based  upon  the  sensitivity  of  the  trajectory  to  variations  of  the  weights  on  the  currently  existing  set 
of  RBFs.  The  method  adapts  both  the  center  locations  and  the  local  sharpness  of  the  RBFs,  and 
uses  the  converged  result  from  the  previous  iterations  to  initiate  the  subsequent  iteration  with  an 


accurate  starting  iterative  which  satisfies  the  terminal  boundary  contions.  The  convergence  and 
efficacy  of  the  method  is  studied  through  two  examples  (an  optimal  trajectory  problem  and  an  op¬ 
timal  aerodynamic  shape  problem)  fopr  which  the  optimal  solution  has  been  previously  deternuned 
in  the  literature.  The  method  is  also  compared  to  a  non-adaptive  RBF  approach  and  the  results 
clearly  establish  the  validity  and  attractiveness  of  this  approach. 

In  attachment  [14],  we  introduce  a  potentially  revolutionary  method  for  simulating  dynam¬ 
ics  of  nonUnear  multi-body  systems  wherein  a  configuration-variable  mass  matrix  occurs.  In  con¬ 
ventional  algorithms,  computing  acceleration  requires  inversion  of  this  configuration-viable 
mass  matrix  which  directly  limits  the  speed  and  precision,  and  ultimately,  the  practical  dimension¬ 
ality  of  multibody  simulations.  It  also  means  that  so-called  order  N  methods  are  not  really  order 
N  when  considering  the  dynamics  of  nonlinear  flexible  multibody  systems.  The  new  method  in- 
troduced  involves  a  unique  coordinate  transformation  to  a  new  coordinate  system  which  maps  the 
instantaneous  mass  matrix  into  an  identity  matrix.  This  is  not  done  by  solving  a  local  algebraic 
eigenvalue  problem  via  conventional  solvers,  but  rather  new  differential  equations  are  derived  that 
inherently  generate  the  instantaneous  diagonalizing  transformation.  The  validity  and  utility  of  the 
algorithm  is  proven  conclusively  in  [14],  including  a  low  dimensioned  application,  and  in  [19],  we 
apply  it  to  a  14th  order  dynamical  model  for  the  Freewing  Scorpion  UAV .  These  analytic^  and 
numerical  studies  prove  the  validity  and  show  that  this  formulation  has  broad  applicability  in  non¬ 
linear  multi-body  dynamics. 

3.0  Conclusions 

It  is  evident  that  the  research  progress  is  excellent  on  many  fronts.  We  have  achieved  sig¬ 
nificant  analytical  progress  and  in  several  important  instances  have  progressed  from  introduction 
of  a  basic  concept,  to  analytical  studies,  and  proof-of-concept  conputational  and  hardware  demon¬ 
strations,  within  this  three  year  effort.  Of  course,  this  progress  has  been  achieved  in  large  measure 
due  to  historical  investments  of  AFOSR  resources  in  support  of  our  effort  to  develop  the  analytical 
and  experimental  foundation  upon  which  this  progress  rests.  It  is  also  significant  that  the  ancillary 
financial  support  obtained  from  Texas  Advanced  Research  Project  gr^ts  has  greatly  accelerated 
our  work  and  thereby  leveraged  the  AFOSR  investment.  It  is  of  special  significance  to  note  that 
five  exceptional  graduate  students  and  a  postdoctoral  researcher  have  been  supported  during  this 
project  and  three  of  the  four  Ph.  D.  students  have  successully  defended  their  dissertations.  Thus, 
quite  apart  from  the  technical  fruits  of  this  research  project,  the  development  of  outstanding  young 
engineers  and  scientists  has  been  significant  indeed. 
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Construction  of  Benchmark 
Problems  for  Solution  of 
Ordinary  Differential 
Equations 


An  inverse  method  is  introduced  to  construct  benchmark  problems  for  the  numerical 
solution  of  initial  value  problems.  Benchmark  problems  constructed  in  this  fashion 
have  a  known  exact  solution,  even  though  analytical  solutions  are  generally  not 
obtainable.  The  process  leading  to  the  exact  solution  makes  use  of  an  initially  avail¬ 
able  approximate  numerical  solution.  A  smooth  interpolation  of  the  approximate 
solution  is  forced  to  exactly  satisfy  the  differential  equation  by  analytically  deriving  a 
small  forcing  function  to  absorb  all  of  the  errors  in  the  interpolated  approximate 
solution.  Using  this  special  case  exact  solution,  it  is  possible  to  directly  investigate  the 
relationship  between  global  errors  of  a  candidate  numerical  solution  process  and  the 
associated  tuning  parameters  for  a  given  code  and  a  given  problem.  Under  the  as¬ 
sumption  that  the  original  differential  equation  is  well-posed  with  respect  to  the  small 
perturbations,  we  thereby  obtain  valuable  information  about  the  optimal  choice  of  the 
tuning  parameters  and  the  achievable  accuracy  of  the  numerical  solution.  Five  illus¬ 
trative  examples  are  presented,  ©  1994  John  Wiley  &  Sons,  Inc. 


INTRODUCTION 

We  consider  the  initial  value  problem  for  linear 
or  nonlinear  ordinary  differential  equations.  In 
general,  we  do  not  know  the  true  solution  and 
any  numerical  method  gives  us  an  approximate 
solution;  the  numerical  solutions  generally  con¬ 
tain  two  sources  of  error,  round-off  and  trunca¬ 
tion  (Gear,  1971).  We  must  somehow  evaluate 
the  accuracy  of  a  given  approximate  solution, 
typically  without  knowing  the  true  solution.  The 
most  common  way  of  assessing  the  true  error  of 
a  numerical  solution  is  to  reduce  some  tolerance 
parameter,  integrate  again,  and  compare  the 
results  (Hairer  et  al.,  1987;  Shampine,  1987).  Al¬ 
though  more  sophisticated  error  analyses  can  be 
conducted,  there  is  no  general  way  to  absolutely 
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guarantee  the  final  accuracy  of  the  solutions. 
This  does  not  preclude  obtaining  practical  solu¬ 
tions  for  most  applications,  but  it  remains  very 
difficult  to  answer  subtle  questions. 

Many  numerical  methods  are  available  for 
solving  initial  value  problems.  Early  numerical 
methods  were  merely  fixed  step  size  implementa¬ 
tions  and  these  methods  were  straightforward  to 
implement,  but  the  results  were  often  inconclu¬ 
sive.  In  the  1960s,  research  on  numerical  meth¬ 
ods  for  highly  nonlinear  initial  value  problems  led 
to  adaptive  methods  that  could  automatically 
vary  the  step  size  and/or  the  order  of  the  method 
to  match  a  user-specified  local  error  tolerance  at 
each  step.  This  work  led  to  the  current  genera¬ 
tion  of  numerical  methods.  Due  the  presence  of 
round-off  error,  it  is  common  to  find  that  accu- 
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racy  improves  until  step  sizes  or  tolerances  are 
decreased  below  some  critical  value;  the  accu¬ 
racy  then  degrades  while  solution  costs  increase 
(Gear,  1971;  Shampine,  1974).  Shampine  (1974, 
1980)  pointed  out  that  a  typical  adaptive  code 
will  not  quit  when  impossible  accuracies  are 
specified.  He  also  reported  that  the  standard 
ways  to  assess  true  errors  may  lead  to  wrong 
conclusions  even  using  the  best  codes  available 
at  that  time.  Shampine  (1974)  considered  a  ma¬ 
chine  dependent  limit  on  the  step  size  and  one  on 
the  local  error  tolerance,  and  he  suggested  a  way 
of  automatically  selecting  an  initial  step  size  that 
appears  to  be  reliable  and  reasonably  efficient 
(Shampine,  1978).  Enright  (1989)  pointed  out  that 
the  relationship  between  the  accuracy  obtained 
and  the  specified  tolerances  is  generally  ex¬ 
tremely  sensitive  to  both  the  problem  and  the 
method.  In  particular,  for  Runge-Kutta  methods 
with  interpolants,  he  proposed  an  error  and  step 
size  control  mechanism  based  on  monitoring  and 
controlling  the  defect  of  a  continuous  approxima¬ 
tion  rather  than  the  local  error  of  the  discrete 
approximation. 

In  view  of  the  historical  and  recent  develop¬ 
ments,  we  observe  that  the  theory  of  differential 
equation  solvers  is  far  from  complete,  so  that  the 
understanding  of  a  given  code’s  performance  in¬ 
variably  requires  a  study  of  experimental  results. 
Hull,  et  al.  (1972)  and  Krogh  (1973)  provided  two 
outstanding  collections  of  test  problems  for  this 
purpose.  These  test  problems  have  been  used  in 
the  development  and  testing  of  many  codes  and 
can  be  regarded  as  standard  benchmark  prob¬ 
lems  for  initial  value  problem  solvers.  Whenever 
we  know  the  true  solutions  of  a  test  problem, 
however,  we  can  investigate  the  relationship  be¬ 
tween  the  true,  or  global  error  and  the  tuning 
parameters  of  a  given  code  (e.g.,  step  size,  local 
error  tolerance,  order,  etc.).  The  relationship  be¬ 
tween  the  behavior  of  an  algorithm  on  a 
benchmark  problem  and  the  behavior  of  the  algo¬ 
rithm  on  a  problem  of  interest  is  difficult  to  estab¬ 
lish.  Because  the  problem  of  interest  is  almost 
never  exactly  solvable,  we  need  a  means  to  es¬ 
tablish  a  customized  benchmark  problem  that  is  a 
close  neighbor  of  any  given  problem  of  interest. 
We  introduce  here  a  broadly  applicable  inverse 
method  that  constructs  a  neighbor  of  a  given  nu¬ 
merical  approximate  solution;  the  neighboring 
problem  does  in  fact  exactly  satisfy  the  original 
differential  equations  (with  a  known,  small 
forcing  function)  and  serves  as  an  excellent 
benchmark  problem.  More  specifically,  we  pre¬ 


sent  a  broadly  useful  approach  to  construct  a 
benchmark  problem  near  the  problem  of  interest 
in  a  particular  application.  By  virtue  of  the  fact 
that  the  benchmark  problem  is  a  customized  near 
neighbor  of  the  problem  of  interest,  we  show 
that  numerical  convergence  studies  on  the 
benchmark  problem  are  directly  useful  in  algo¬ 
rithm  selection,  tuning,  and  accuracy  validation. 

The  difficulties  mentioned  earlier  result  from 
not  knowing  the  true  solution.  What  happens  if 
we  are  able  to  construct  a  problem-dependent 
“exact”  benchmark  problem?  First  we  can  eas¬ 
ily  investigate  the  true  error/parameter  relation¬ 
ship  and  find  the  limiting  precision  and  associ¬ 
ated  values  of  critical  parameters  of  a  given 
code.  Second,  the  problem  of  how  to  assess 
global  error  vanishes  automatically.  Finally,  we 
have  an  absolute  standard  to  find  which  method 
is  most  suitable  for  an  important  member  of  our 
particular  family  of  problems.  The  sensitivity  of 
the  accuracy/tolerance  relation  of  a  given 
method  is  primarily  a  result  of  the  heuristics  used 
to  monitor  the  local  error  and  control  the  step 
size.  If  we  do  not  know  the  true  solution,  then  it 
is  very  hard  to  assess  which  method  is  the  best 
for  a  class  of  problems  because  of  the  high  sensi- 
tivity  of  accuracy  to  variations  in  step  size  con- 
trol  logic.  The  remaining  and  most  critical  ques¬ 
tion  is:  How  useful  is  the  convergence  and 
accuracy  information  obtained  for  the  exactly 
solved  benchmark  problem,  in  regard  to  drawing 
conclusions  for  the  (neighboring)  original  prob¬ 
lem?  It  is  important  to  recall  that  the  benchmark 
problem  includes  a  regular  perturbation  to  the 
original  problem.  If  the  perturbation  is  small 
enough,  it  is  to  be  expected  that  all  derivatives 
will  be  close  for  the  two  problems  and  conse¬ 
quently,  the  behavior  of  standard  discrete  vari¬ 
able  methods  will  be  similar  both  with  respect  to 
accuracy  and  stability.  It  is  certainly  true  that 
there  are  open  questions  on  this  issue  needing 
further  investigation;  however,  by  constructing  a 
family  of  neighboring  benchmark  problerns,  it  is 
usually  possible  to  judge  the  size  of  the  neighbor¬ 
hood  in  which  the  convergence  and  accuracy 
properties  are  relatively  invariant  with  respect  to 
the  perturbation.  Several  applications  presented 
herein  provide  strong  evidence  supporting  the 
practicality  of  this  approach. 

In  this  study  we  propose  a  method  to  con¬ 
struct  a  benchmark  problem  that  is  a  close  neigh¬ 
bor  of  a  given  approximate  solution  of  the  origi¬ 
nal  problem.  The  benchmark  problem  is 
constructed  so  that  it  satisfies  exactly  the  differ- 
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ential  equation  but  with  a  known,  usually  small, 
time  varying  forcing  function.  We  can  investigate 
the  global  error/parameter  relationship  of  the 
benchmark  problem  with  the  true  solution  in 
hand.  Under  the  assumption  that  the  original 
problem  is  well-posed  with  respect  to  small  per¬ 
turbations,  we  have  valuable  information  about 
the  optimal  parameters  and  the  accuracy  of  the 
numerical  solution.  Actually  the  stability  as¬ 
sumption  is  not  so  severe  because  any  numerical 
method  needs  it  more  or  less  to  obtain  reliable 
solutions.  Also,  by  introducing  several  neighbor¬ 
ing  approximate  solutions  with  initial  condition 
and  parameter  variations,  then  repeating  the  en¬ 
tire  process,  it  is  possible  to  experimentally  es¬ 
tablish  insight  on  the  size  of  the  region  over 
which  the  convergence  properties  are  invariant. 

Lee  and  Junkins  (1993)  presented  two  com¬ 
puter  codes  for  first  order  and  second  order  sys¬ 
tems  of  differential  equations,  when  the  classical 
Runge-Kutta  fourth  order  method  with  a  fixed 
step  size  was  used.  An  illustrations,  we  show  the 
utility  of  these  codes  for  two  simple  nonstiff 
problems.  When  we  use  the  IMSL  (1989)  subrou¬ 
tines  DIVPRK  and  DIVPBS  as  solvers,  we  show 
the  utility  of  this  methodology  for  two  celestial 
mechanics  problems  (Krogh,  1973)  that  have 
been  used  as  test  problems  several  times  in  the 
literature.  Subroutine  DIVPRK  uses  the  Runge- 
Kutta  formulas  of  order  five  and  six  developed 
by  J.  H.  Vemer.  Subroutine  DIVPBS  uses  the 
Bulirsh-Stoer  extrapolation  method  and  will  ter¬ 
minate  when  impossible  accuracies  are  specified. 
In  the  fifth  example,  we  consider  a  typical  stiff 
problem  and  discuss  some  limitations  and  restric¬ 
tions  of  this  methodology. 


CONSTRUCTION  OF  EXACT 
BENCHMARK  PROBLEMS 

We  want  to  construct  new  differential  equations 
that  are  slightly  perturbed  versions  of  the  original 
differential  equations.  For  these  new  differential 
equations,  we  can  establish  the  true  analytical 
solution  using  an  algebraic  inverse  idea.  Then  we 
can  investigate  the  error/tolerance  relationship 
with  an  absolute  standard.  Under  local  stability 
assumptions,  we  have  valuable  information 
about  the  optimal  parameters  and  the  accuracy  of 
the  particular  numerical  solution  for  the  given 
original  differential  equations.  The  stability  as¬ 
sumption  is  easily  validated  by  constructing 
some  neighboring  benchmark  problems. 


Here  we  introduce  one  way  for  constructing 
exact  benchmark  problems.  We  take  a  global  ap¬ 
proach  for  the  perturbation  term  instead  of  a 
piecewise  polynomial  perturbation  to  avoid  the 
lack  of  smoothness  at  break  points.  First  we  con¬ 
sider  the  following  two  distinct  initial  value  prob¬ 
lems; 

X  =  /i(.v,  /).  x(/o)  =  ^9  over  ... 

X  =  fiix,  X,  t),  xUo)  =  xo,  xUo)  =  xo 

over  to  <  /  s  jy  (2) 

/,:  Rf^  X  R!^  X  R-^ 

A  candidate  discrete  approximate  solution  can  be 
obtained  from  the  original  first  or  second  order 
differential  Eqs.  (1)  and  (2)  using  a  numerical 
method.  We  distinguish  between  first  and  second 
order  systems  because  there  are  certain  draw¬ 
backs  if  one  converts  a  naturally  second  order 
system  into  a  first  order  system.  To  establish  a 
continuous,  differentiable  motion  near  a  given 
approximate  solution,  least  square  approxima¬ 
tion  using  the  discrete  version  of  the  Chebyshev 
polynomials  can  be  invoked  to  obtain  the  solu¬ 
tion  from  the  the  already  discrete  solution  (Abra- 
mowitz  and  Stegun,  1972;  Junkins,  1978).  We 
first  consider  the  least  square  approximation  pro¬ 
cess.  There  are  n  data  points  denoted  as 

XI  —  g(t|),  X2  —  g(h)f  ■  .  .  ,  x„  —  g(t„) 

where  t,-  are  the  values  of  the  equally  spaced  in¬ 
dependent  variable  (/i,  =  (t,-+i  -  /,)  =  constant). 

A  linear  transformation  of  independent  vari¬ 
ables  should  be  made  to  use  discrete  orthogonal¬ 
ity  with  weight  function  u'(/)  =  1, 


where  h,  is  the  constant  increment  of  t, 

X  =  g(r)  =  G(/).  (3) 

From  n  data  points,  the  function  G  can  be  estab¬ 
lished  as  a  linear  combination  of  m  basis  func¬ 
tions  that  form  the  discrete  version  of  the 
Chebyshev  polynomials  as  follows: 

m 

GO)  =  E  atTiO) 

1=  I 
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where  w  <  «  and  W)  is  the  /th  Chebyshev  poly- 
nomial. 

The  Chebyshev  polynomials  are  defined  as 
follows:  If  Urn  ”  =  0,  1,  2,  .  .  .  ,  N)  and 

w{u)  =  1,  then 

n  (n\ln  +  m\  -  m)\ 

With  the  recurrence  relations: 

Uu)  =  1 
Uu)  =  1  -  ^ 

(rt  +  1)(N  -  n)7'„+i(M)  =  {In  -I-  1)(A^  -  2u)T„{u) 

—  n{N  +  n  +  \)T„-\{u). 

Note  that  the  recurrence  relations  make  it  easy 
to  evaluate  an  expansion  in  Chebyshev  polyno¬ 
mials,  and  a  similar  recurrence  makes  it  easy  to 
evaluate  the  derivative  of  the  expansion. 

Using  discrete  orthogonality  of  the  Chebyshev 
polynomials,  the  typical  coefficient  oj  can  be  ob¬ 
tained  as  follows: 

”  2-=,  Tj{i,)Tj{h) 
where  I  ^  j  ^  m. 

We  can  find  g{t)  from  G{t)  because  g{t)  = 
G{t{t)).  Using  the  least  square  approximation, 
we  can  find  the  continuous,  differentiable,  ana¬ 
lytical  solution  x{t)  of  Eq.  (3)  that  interpolates 
the  n  discrete  numerical  solutions  obtained  from 
Eqs.  (1)  and  (2).  Now  this  analytical  expression 
a:(/)  does  not  satisfy  exactly  the  Eqs.  (1)  and  (2). 
However,  substituting  -v(/),  x{t)  into  Eq.  (1)  al¬ 
lows  us  to  determine  an  analytical  function  for 
the  perturbation  term  CiU)  that  appears  in  the 
following  differential  equation: 

i(/)  =  fi{x{t),  t)  +  eM  =  F,(jr,  /).  (4) 

Alternatively,  if  the  system  is  second  order,  then 
substituting  x{t),  x(f),  x(r)  into  Eq.  (2)  allows  us 
to  determine  the  perturbation  term  ^2(0  that  ap¬ 
pears  in  the  following  differential  equation: 

x(/)  =  f2(x(0,  x{t),  t)  +  e2{t)  =  F2{x,  X,  t). 

(5) 

Note  that  because  x{t),  x{t),  x{t)  are  available 
functions,  Fi{x,  t),  Fiix,  x,  t)  are  also  available 


functions  that  satisfy  Eqs.  (4)  and  (5)  exactly, 
and  x(t)  is  a  neighbor  of  the  original  numerical 
solution  {xt ,  X2,  ~  ■  •  5  -V/i}-  By  construction,  the 
functions  e,(0  =  x(t)  -  /i(.v(/),  t)  and  cjlO  = 
x{0  -  fiixU),  x{t),  t)  are  known  analytically  and 
therefore  these  small  forcing  functions  can  be 
computed  exactly  at  all  t.  These  functions  are 
programmed  and  Eqs.  (4)  and  (5)  can  be  solved 
by  numerical  methods  and  the  results  can  be 
compared  to  the  exact  x{t),  xiO-  The  above 
mathematical  procedure  can  be  performed  in  an 
automated  fashion  using  computer  symbol  ma¬ 
nipulation.  The  symbol  manipulation  can  also  au¬ 
tomate  the  generation  of  C  or  FORTRAN  Code 
to  compute  function  e,  (/)  and/or  eiU). 

Now  Eq.  (4)  is  a  benchmark  problem  neigh¬ 
boring  Eq.  (1)  and  we  have  arranged  that  xU), 
m  satisfy  Eq.  (4)  exactly;  and  Eq.  (5)  becomes 
the  benchmark  problem  neighboring  Eq.  (2)  and 
we  have  arranged  that  x{t),  x{t),  x(t)  satisfies  Eq. 
(5)  exactly.  We  obviously  want  the  perturbation 
function  e(t)  to  be  as  small  as  possible,  that  is, 
the  benchmark  problem  is  not  only  a  near  neigh¬ 
bor  of  the  original  discrete  solution,  but  it  also 
very  nearly  satisfies  the  same  differential  equa¬ 
tions.  The  previously  discussed  least  square  ap¬ 
proximation  method  typically  gives  the  poorest 
approximation  near  the  ends  of  the  interval.  This 
may  result  in  a  relatively  large  e(t)  near  the  initial 
and  final  times.  To  avoid  this  problem  we  can 
integrate  Eqs.  (1)  and  (2)  over  the  enlarged  inter¬ 
val  /o-  —  t  —  (/■+  (where  to-  <  to ,  t/-  >  t/)  and  use 
these  numerical  results  as  generators  for  analyti¬ 
cal  solutions  over  the  original  interval  (to  ^  t  ^ 
tf).  Experience  indicates  that  a  20%  enlarge¬ 
ment”  {(r/+  -  to-)  s  1.2(//-  to)}  is  almost  always 
sufficient  to  support  good  interpolation  over  the 
original  interval  (to  —  t  —  tf).  If  the  measure  of 
e{t)  is  judged  too  large  then  we  increase  the  num¬ 
ber  of  Chebyshev  polynomials  m  to  reduce  e(t) 
over  the  whole  interval,  or  ‘  start  over  by  at¬ 
tempting  to  find  a  better  approximate  numerical 
solution  to  initiate  the  process.  Figures  1  and  2 
provide  logical  flow  charts  showing  construction 
of  a  benchmark  problem  and  an  associated  con¬ 
vergence  study  for  second  order  systems. 


ILLUSTRATIVE  EXAMPLES 

Now  we  demonstrate  the  previous  ideas  using 
five  initial  value  problems  for  ordinary  differen¬ 
tial  equations.  First  we  show  the  utility  of  the 
computer  codes  (Lee  and  Junkins,  1993)  for  two 
simple  nonstiff  problems.  Then,  two  celestial  me- 


# 


♦ 
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FIGURE  1  Flow  chart  for  construction  of  a 
benchmark  problem. 


chanics  problems  are  introduced  to  illustrate  the 
utility  of  this  methodology  when  we  use  the 
IMSL  (1989)  subroutines  DIVPRK  and  DIVPBS. 
Finally,  we  consider  a  stiff  problem  in  the  fifth 
example. 

First  Order  Systems 

We  consider  the  following  pair  of  nonlinear  dif¬ 
ferential  equations. 


i  =  2jci  -  lx\X2 
X  =  -X2  +  X\X2 


(6) 


where  xi(0)  =  1  and  X2(0)  =  3,  and  we  seek  the 
solution  over  the  interval  0  s  ^  10. 

First,  we  solve  Eq.  (6)  using  the  Runge-Kutta 
fourth  order  method  to  evaluate  the  candidate 
discrete  approximate  solution.  Here  we  use  121 


data  points  over  the  20%  enlarged  time  interval 
- 1  <  t  <  1 1 .  Second,  we  establish  a  continuous, 
differentiable,  analytical  expression  for  interpo¬ 
lating  xi(/)  and  X2{t)  from  the  discrete  approxi¬ 
mate  solution.  We  use  51  Chebyshev  polynomi¬ 
als  for  fitting.  Finally  we  substitute  ,ri(/).  .v;(/), 
jc,(0,  i(t)  into  Eq.  (6)  and  determine  functions  for 
e,(/)  and  e2{t)  that  satisfy  the  following  equations 
exactly 


ii  =  2xi  -  2x,X2  +  e, 

X2  =  -Xi  +  X\X2  +  e2- 

Now,  Eq.  (7)  provides  a  benchmark  problem 
for  Eq.  (6),  and  JCi(r),  XiU)  are  the  solutions  that 
satisfy  Eq.  (7)  exactly.  Upon  solving  Eq.  (7)  nu¬ 
merically  with  various  values  chosen  for  It,  we 
establish  the  relationship  between  step  size  and 
global  error.  When  we  use  the  pointwise  error  in 
the  root  mean  square  sense.  Fig.  3  shows  the 
relationship  in  log/log  scale.  The  critical  value  h 
is  about  0.0005  and  if  h  decreased  below  0.0005, 
then  the  results  begin  to  deteriorate.  The  rate  of 
convergence  is  4  in  this  problem  and  this  coin¬ 
cides  with  the  fact  that  an  rth  order  method 
should  have  a  global  error  of  O  (/!'')  in  the  absence 
of  arithmetic  errors  (Gear,  1971).  Figure  4  shows 
the  perturbation  terms  over  the  time  interval.  For 
the  benchmark  problem,  the  numerical  results 
are  very  reliable  when  we  use  0.0005  as  It  be¬ 
cause  the  error  measures  are  about  10"'-  while 
the  solutions  for  jr,(r),  X2{t)  vary  from  10"-  to  10“ 
order.  Now  we  turn  our  attention  to  the  original 
problem.  Figure  5  shows  the  relationship  be¬ 
tween  step  size  and  error  at  /  =  10  on  a  log/log 
scale  for  the  original  problem.  Because  we  do  not 
know  the  true  solution,  we  could  follow  the  com¬ 
mon  way  of  assessing  the  accuracy  of  a  family  of 
approximate  solutions  using  the  IMSL  (1989) 
subroutines  DIVPRK  and  DIVPBS.  Comparing 
'  Figs.  3  and  5,  we  notice  that  the  shape  is  roughly 
similar  but,  in  Fig.  5,  the  critical  value  h  is  0.0002 
instead  of  0.0005.  The  reason  for  this  minor  dis¬ 
crepancy  is  the  relatively  large  perturbation 
terms  in  Fig.  4.  If  we  decrease  the  perturbation 
terms  eM  and  €2(0  by  finding  a  higher  order, 
more  accurate  interpolation  and  thereby  make 
the  benchmark  problem  closer  to  the  original  Eq. 
(6),  then  we  can  reduce  this  discrepancy. 


Second  Order  Systems 

We  consider  the  following  nonlinear,  nonautono- 
mous  second  order  differential  equation. 
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FIGURE  2  Flow  chart  for  convergence  study. 


FIGURE  3  Global  error  vs.  step  size  for  the 
benchmark  problem. 
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FIGURE  5  Error  (at  t  =  10)  vs.  step  size  for  the  FIGURE  6  Global  error  vs.  step  size  for  the 
original  problem.  benchmark  problem. 


# 


-X-  0.1(1  +  x'^x  +  O.Ia:^  +  sin  3/  (8) 

where  a:(0)  =  1  and  i(0)  =  0,  and  we  seek  the 
solution  over  the  interval  0  s  r  <  10.  We  convert 
Eq.  (8)  to  a  first  order  system  as  follows: 


.f2  =  — -  0.1(1  +  x\)xi  +  0.1.r]  +  sin  3t 

where  JC|(0)  =  1  and  ^-2(0)  =  0. 

We  solve  Eq.  (9)  using  the  Runge-Kutta 
fourth  order  method  to  evaluate  the  candidate 
discrete  approximate  solution.  Here  we  con¬ 
struct  the  interpolated  solution  using  121  data 
points  over  the  20%  enlarged  time  interval  - 1  ^ 
/  <  11.  An  analytical  expression  for  .V|(t)  is  ob¬ 
tained  from  the  discrete  approximate  solution.  In 
this  problem,  a  degree  30  Chebyshev  polynomial 
is  established  by  the  least  square  approximation. 
Substituting  xi(/),  i|(/),  i:i(/).  into  Eq.  (8)  we  cal¬ 
culate  the  function  e{t)  that  satisfies  the  follow¬ 
ing  equation  exactly. 

X  =  —X  -  0.1(1  +  x^)x  +  O.lx^  +  sin  3/  -I-  e. 

(10) 

To  use  the  Runge-Kutta  method,  Eq.  (10)  can  be 
converted  to  a  first  order  system  as  follows: 


X2  =  -x\  -  0.1(1  +  Jt?)x2  +  0.1x1  -I-  sin  3/  +  e. 

Now,  Eq.  (10)  becomes  a  benchmark  problem 
for  Eq.  (8),  and  xU)  is  an  algebraic  function  that 
satisfies  Eq.  (10)  exactly.  When  we  use  the 
pointwise  error  in  the  root  mean  square  sense. 


Fig.  6  shows  the  relationship  between  global  er¬ 
ror  and  step  size.  The  rate  of  convergence  is  4  as 
expected.  Figure  7  shows  the  perturbation  term 
over  the  time  interval.  The  critical  value  for  step 
size  is  about  0.001.  Now  we  consider  the  original 
problem.  The  relationship  between  step  size  and 
error  at  /  =  10  is  shown  in  Fig.  8  when  we  follow 
the  common  way  assessing  the  true  solution  us¬ 
ing  the  IMSL  (1989)  subroutines  DIVPRK  and 
DIVPBS.  Comparing  Figs.  6  and  8.  we  observe 
that  the  critical  value  h  and  the  accuracy  are  al¬ 
most  the  same. 

We  change  the  initial  conditions  slightly  and 
the  nonautonomous  term  in  the  differential  equa¬ 
tion  as  follows: 

X  =  -r  -  0.1(1  +  x-)x  -1-  O.l.Y^  -f  1.2  sin  3t 

(12) 


where  .y(0)  =  1.2  and  .v(0)  =  0.2  over  the  interval 
0  <  /  <  10. 


FIGURE  7  Perturbation  term  of  example  2. 
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FIGURE  8  Error  (at  /  =  10)  vs.  step  size  for  the 
original  problem. 


After  using  the  same  procedure,  we  obtain  the 
global  error/step  size  relationship  shown  in  Fig. 
9.  We  notice  that  Figs.  6  and  9  are  almost  the 
same.  In  other  words,  the  critical  value  for  h  and 
the  accuracy  are  almost  identical  even  though 
there  are  20%  perturbations  in  the  initial  condi¬ 
tion  and  the  forcing  term  in  the  differential  equa¬ 
tion,  in  this  case. 

Two  Body  Problem 

We  consider  the  simple  two  body  problem.  The 
exact  solution  is  periodic  with  period  2-n  and  the 
solution  traces  out  an  ellipse  with  eccentricity 
0.6. 

=  -xlr\  x(0)  =  0.4,  i(0)  =  0 
j  =  -yh\  y(0)  =  0,  y(0)  =  2 

where  r  =  (x-  -I- 


FIGURE  9  Global  error  vs.  step  size  for  the 
benchmark  problem  of  20%  perturbations. 


FIGURE  10  Absolute  error  vs.  tolerance  for  the 
benchmark  problem  (DIVPRK). 


These  equations  can  be  solved  exactly  (Battin. 
1987);  the  analytical  solution  is  not  included  here 
because  of  space  limitations.  We  reformulate  Eq. 
(13)  as  a  first  order  system  as  follows: 

x\  =  A': 

*  =  +  «>’■"  ,14, 

A3  =  .V4 

X4  =  -Xi/(x]  +  xi)^'- 

where  X|(0)  =  0.4,  .vifO)  =  0,  -X'3(0)  =  0,  .t4(0)  •“  2. 

We  solve  Eq.  (14)  using  DIVPRK  to  evaluate 
the  candidate  discrete  approximate  solution. 
Here  we  use  121  data  points  over  the  20%  en¬ 
larged  time  interval  and  a  degree  50  Chebyshev 
polynomial  approximation  is  used  for  the  least 
square  fitting  of  x\U)  and  .Y3(/).  After  construct¬ 
ing  the  benchmark  problem,  we  do  an  absolute 
error  test  on  (0.  Itt).  Figures  10  and  1 1  show  the 


FIGURE  11  Absolute  error  vs.  tolerance  for  the 
benchmark  problem  (DIVPBS). 
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FIGURE  12  Absolute  error  vs.  tolerance  for  the  two 
body  problem  (DIVPRK). 


relationship  between  absolute  error  and  toler¬ 
ance  in  log/log  scale  when  we  use  DIVPRK  and 
DIVPBS  for  the  benchmark  problem.  Figures  12 
and  13  show  the  relationship  between  absolute 
error  and  tolerance  in  log/log  scale  when  we  use 
DIVPRK  and  DIVPBS  for  the  original  two  body 
problem.  We  notice  that  Figs.  10  and  11  are  al¬ 
most  identical  to  Figs.  12  and  13,  respectively. 
The  perturbation  terms  are  shown  in  Fig.  14.  We 
plot  the  relationship  between  the  number  of  func¬ 
tion  calls  and  the  absolute  error  in  Fig.  15.  Thus 
the  benchmark  problem  (constructed  by  the 
method  of  this  study)  essentially  gives  results 
that  are  identical  to  those  obtained  by  using  the 
exact  solution  of  the  original  problem. 

Euler  Equations  of  Motion 

We  consider  the  Euler  equation  of  motion  for  a 
rigid  body  without  external  forces. 


FIGURE  13  Absolute  error  vs.  tolerance  for  the  two 
body  problem  (DIVPBS). 


FIGURE  14  Perturbation  terms  of  the  two  body 
problem. 


Xi  =  .V2.V3 

=  -O.5I.Y3.V,  (15) 

X3  =  -XtX2 

where  xi(0)  =  0,  .V2(0)  =  1,  X3(0)  =  1. 

The  classical  exact  solutions  of  Eq.  (15)  are 
the  Jacobian  elliptic  functions  (Abramowitz  and 
Stegun,  1972)  as  follows: 

xi  =  sn(t  I  0.51),  .Y2  =  dn{t  1  0.51), 

Xi  =  cnU  I  0.51). 

They  are  periodic  with  a  quarter  period  K  where 
K  =  1.86264  08023  32738  55203  •  •  •  in  this 
case. 

We  solve  Eq.  (15)  using  DIVPRK  to  evaluate 
the  candidate  discrete  approximate  solution.  To 


FIGURE  15  Number  of  function  calls  vs.  absolute 
error. 
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FIGURE  16  Absolute  error  vs.  tolerance  for  the 
benchmark  problem  (DIVPRK). 


FIGURE  18  Absolute  error  vs.  tolerance  for  the 
Euler  equations  (DIVPRK). 


establish  a  benchmark  using  our  method,  we  use 
121  data  points  over  the  20%  enlarged  time  inter¬ 
val  and  determine  a  degree  50  Chebyshev  least 
square  polynomial  approximation  of  X|(/),  XiU), 
and  XiO).  After  constructing  the  benchmark 
problem,  we  do  an  absolute  error  test  on  (0, 4  K). 
Figures  16  and  17  show  the  relationship  between 
absolute  error  and  tolerance  in  log/log  scale 
when  we  use  DIVPRK  and  DIVPBS  for  the 
benchmark  problem.  Figures  18  and  19  show  the 
relationship  between  absolute  error  and  toler¬ 
ance  in  log/log  scale  when  we  use  DIVPRK  and 
DIVPBS  to  solve  Eq.  (15)  and  compare  to  the 
classical  Jacobian  elliptic  function  solution.  We 
notice  that  Figs.  16  and  17  are  almost  identical  to 
Figs.  18  and  19,  respectively.  The  perturbation 
terms  are  shown  in  Fig.  20.  We  plot  the  relation¬ 
ship  between  the  number  of  function  calls  and 
the  absolute  error  in  Fig.  21.  Thus,  again, 
this  example  indicates  that  our  neighboring 


benchmark  problem  leads  to  essentially  identical 
convergence  properties  to  using  the  exact  solu¬ 
tion  of  the  original  problem. 


A  Stiff  Problem 

We  consider  the  following  problem  (Shampine 
and  Gordon,  1975)  that  represents  a  typical  stiff 
problem. 

.V,  =  -29998.V,  -  39996.T2 

(16) 

.Y2  =  14998.5.Y,  -1-  19997.Y2 

where  .V|(0)  =  I.  .Y2(0)  =  1. 

The  exact  solutions  of  Eq.  (16)  are  as  follows: 

.ri(;)  =  7  exp(-10''r)  -  6  exp(-/) 

(17) 

.V2(/)  =  -3.5  exp(-IO-'t)  +  4.5  exp(-/). 


DIVPBS 


5-^ - , - ^ ^ - . - 1 - - - ■ - ' - 1 - ■ - ' - 

-15  ~10  -5  0 

LOG,o(Tolerance) 


FIGURE  17  Absolute  error  vs.  tolerance  for  the  FIGURE  19  Absolute  error  vs.  tolerance  for  the 
benchmark  problem  (DIVPBS).  Euler  equations  (DIVPBS). 
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FIGURE  20  Perturbation  terms  of  the  Euler  equa¬ 
tions. 


The  eigenvalues  of  the  coefficient  matrix  are  --I 
and  Figures  22  and  23  show  the  solutions 
over  two  different  intervals,  a  region  of  very 
rapid  change  followed  by  gradual  asymptotic  be¬ 
havior.  It  is  almost  impossible  to  obtain  a  satis¬ 
factory  orthogonal  function  benchmark  problem 
that  covers  both  regions  with  a  reasonable  num¬ 
ber  of  terms.  We  conclude  that  the  proposed 
methodology  is  not  adequate  for  such  stiff  prob¬ 
lems  unless  piecewise  approximation  methods, 
for  example,  the  type  introduced  by  Junkins  et 
al.  (1973)  are  used.  Stiff  problems  are  relatively 
expensive  to  solve  and  the  expense  depends 
strongly  on  the  tolerance  (Gear,  1971;  Shampine 
and  Gordon,  1975;  Shampine  and  Gear,  1979). 
Enright  et  al.  (1975)  provide  a  good  collection  of 
stiff  test  problems. 


FIGURE  21  Number  of  function  calls  vs.  absolute 
error. 


FIGURE  22  Solution  of  example  5  for  the  rapid 
change  region. 


SUMMARY  AND  CONCLUSION 

The  present  article  introduces  an  inverse  method 
for  constructing  exact  benchmark  problems  for 
initial  value  problems.  This  methodology  gives 
valuable  information  about  the  optimal  tuning  pa¬ 
rameters  and  the  accuracy  of  the  numerical  solu¬ 
tion  for  a  class  of  ordinary  differential  equation 
problems  and  for  a  given  solution  code.  Numeri¬ 
cal  examples  indicate  that  a  rigorous  error  analy¬ 
sis  is  usually  obtained  not  merely  for  one  nominal 
solution,  but  for  a  substantial  neighborhood  of 
the  nominal  solution.  If  one  wants  to  use  the 
classical  Runge-Kutta  method  with  a  fixed  step 
size,  then  the  codes  (Lee  and  Junkins,  1993)  pro¬ 
vide  directly  useful  information  about  the  opti¬ 
mal  step  size  h  and  the  associated  accuracy. 


FIGURE  23  Solution  of  example  5  for  the  gradual 
change  region. 
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More  sophisticated  users  who  are  familiar  with 
adaptive  and  robust  codes  can  also  construct 
similar  benchmark  problems;  however,  the  Che- 
byshev  approximation  method  may  have  to  be 
replaced  or  modified  to  obtain  a  method  not  re¬ 
stricted  to  uniformly  spaced  data.  For  stiff  sys¬ 
tems,  special  purpose  approximations  may  be 
required  in  lieu  of  the  global  Chebyshev  approxi¬ 
mations.  The  analytical  expressions  for  the 
benchmark  problem  and  its  solution  can  be  estab¬ 
lished  using  computer  symbol  manipulation  [e.g., 
MACSYMA  (1988),  Mathematica,  MAPLE, 
etc.].  Then  the  user  investigates  the  global  error/ 
parameter  relationship  and  compares  various 
codes  with  special  case  absolute  standards.  In 
examples  3  and  4,  we  show  the  utility  of  this 
methodology  using  the  IMSL  (1989)  subroutines 
DIVPRK  and  DIVPBS  as  solvers.  And  we  inves¬ 
tigate  the  absolute  error/ tolerance  relationship 
and  compare  DIVPRK  and  DIVPBS.  We  have 
developed  some  basic  methodologies,  but  there 
remains  a  need  for  additional  numerical  experi¬ 
ments  to  further  evaluate  the  practical  utility  of 
this  approach. 
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VALIDATION  OF  FINITE  DIMENSIONAL  APPROXIMATE 
SOLUTIONS  FOR  DYNAMICS  OF 
DISTRIBUTED  PARAMETER  SYSTEMS 

John  L.  Junkins*  and  Sangchul  Lee^ 

An  inverse  dynamics  method  is  introduced  for  constructing  exact  special 
case  solutions  for  hybrid  coordinate  ordinary/partial  syst®"^? 
differential  equations  (hybrid  ODE/PDE  systems),  and  the  utility  of  this 
method  in  validating  numerical  solution  methods  is  explored. 

INTRODUCTION;  Construction  of  Benchmark  Problems  for 
Solution  of  Ordinary  Differential  Equations 

Given  a  flexible  multi-body  dynamical  system,  most  rigorously  described  by  a 
hybrid  system  of  nonlinear  ordinary  and  partial  differential  equations,  we  ° 

validate  simulations  of  the  behavior  of  the  system  by  numerical  met^hods.  With 
most  appUcations  of  approximate  solution  algorithms,  we  must  somehow  evaluate 
the  accuracy  of  a  given  approximate  solution,  without  knowing  the  true  solution 
What  happens  if  we  can  construct  an  exact  forced  response  solution  for  a  special  case 
motion  near(in  a  sense  to  be  established)  a  candidate  approiamate  solution  This 
gives  us  an  absolute  standard  and  promises  the  capability  of  diplaying  e^tly  the 
space/time  distribution  of  solution  errors  for  the  special-case  solution  and  therefore 
suggesting  remedies,  if  needed,  to  improve  the  discretization-based 

^The  idea  is  easily  introduced  by  first  considering  the  imtial  value  problem  for 

nonUnear  ordinary  differential  equations.^  In  general,  we  do 

solution  and  the  numerical  methods  give  us  an  approximate  solution.  The  most 
common  way  of  assessing  the  true  error  of  a  numerical  solution  is  to  reduce  the 

tolerance,  integrate  again,  and  compare  the  results.^-^  'Intel 

error  analyses  can  be  conducted,  there  is  no  general  way  to  absolute^  guarantee 
the  final  accuracy  of  the  solutions.  While  this  does  not  preclude  obtaining  practical 
sltions  for  most  applications,  it  remains  very  difficult  to  answer  subtle  questions 
Actually  the  theory  of  differential  equation  solvers  is  far  from  complete,  so  that 
the  understanding  of  a  given  code’s  performance  invariably  requires  a  study  of 
experimental  results.  Hull,  et  al^  and  Krogh®  provided  two  outstanding  collections 
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of  test  problems  for  this  purpose,  for  the  case  of  ordinary  differential  equations. 
These  test  problems  have  been  used  in  the  development  and  testing  of  the  codes 
and  can  be  regarded  as  standard  benchmark  problems  for  initial  value  problem 

Whenever  we  know  the  true  solution  of  a  test  problem  we  can  investigate 
the  relationship  between  the  true,  or  global  error  and  parameters  of  a  given 
code(e.g.,  step  size,  local  error  tolerance,  order,  etc.).  Of  course,  only  for  a  small 
minority  of  interesting  problems  can  the  initial  value  problem  be  solved  analytically. 
We  introduce  here  an  inverse  method  which  algebraicly  constructs  a  continuous 
neighbor  of  a  given  numerical  approximate  solution;  the  neighboring  continuous 
motion  does  in  fact  exactly  satisfy  the  differential  equations(with  a  known  small 
forcing  function)  and  serves  as  an  excellent  benchmark  problem.  The  remaining  and 
most  critical  question  is:  How  useful  is  the  convergence  and  accuracy  information 
obtained  for  the  benchmark  problem,  as  regards  drawing  conclusions  for  the  original 
problemi  It  is  certainly  true  that  there  are  open  questions  on  this  issue,  however,  by 
constructing  a  family  of  neighboring  benchmark  problems,  it  is  usually  possible  to 
judge  the  size  of  the  neighborhood  in  which  the  convergence  and  accuracy  properties 
are  relatively  invariant  with  respect  to  the  perturbation,  and  thereby  gain  the 
practical  insight  needed  to  proceed  with  confidence  in  a  solution  and  associated 

error  measures.  i  •  i  •  i  i  . 

Now,  we  propose  a  method  to  construct  a  benchmark  problem  which  is  a  closely 

neighboring  trajectory  of  a  given  approximate  solution  of  the  original  problem. 
As  will  be  evident,  the  benchmark  problem  motion  is  constructed  algebraicly  so 
that  it  satisfies  exactly  the  differential  equation  but  with  a  known,  usually  small, 
time  varying  forcing  function.  We  can  then  investigate  the  global  error/parameter 
relationship  of  the  benchmark  problem  with  the  true  solution  in  hand.  Under 
the  assumption  that  the  original  problem  is  well-posed  with  respect  to  small 
perturbations,  we  have  valuable  information  about  the  optimal  parameters  a.nd 
the  accuracy  of  the  numerical  solution.  Through  study  of  a  family  of  neighboring 
benchmark  problems,  we  can  directly  establish  insight  on  the  “stability”  of  this 
error  analysis. 

Initially,  we  restrict  attention  to  nonlinear  ordinary  differential  equation(ODE) 
systems,  we  subsequently  broaden  the  discussion  and  examples  to  consider  hybrid 
differential  equation  systems.  Here  we  introduce  one  way  for  constructing  the  exact 
benchmark  problem.  First  we  consider  the  following  initial  value  problem  for  a 
second  order  ODE  system: 

X  =  f[x,x,t),  x{to)  =  $0)  ^(fo)  =  over  to  <  t  <  tf 

/  :  xR^  xR-^R^ 

Here  we  consider  the  case  where  x  is  a  scalar(i.e.,iV=l).  The  following  approach 
can  be  easily  generalized  for  the  vector  case.  A  candidate  discrete  approximate 
solution  can  be  obtained  from  the  original  second  order  differential  equation  (1) 
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using  a  numerical  method.  To  establish  a  continuous,  differentiable  motion  near  a 
given  approximate  solution,  we  use  a  least  square  approximation  based  upon  the 
discrete  version  of  the  Chebyshev  polynomials;  this  polynomial  approximation  can 
be  established  directly  from  the  discrete  approximate  solution.®-'^  We  first  ^nsider 
the  least  square  process.  There  are  n  data  points  such  as  xj  =  5(<i),  aiz  -  gih), 
_  g(^tn)  where  U  are  the  equally  spaced  values  of  the  independent 

variable(/it  =  (ti+i  —  ti)  =  constant).  ,  -  .  j  *  •  i,  1,1 

A  linear  transformation  to  nondimensionalize  the  independent  variable  should 
be  made  to  use  the  discrete  version  of  the  Chebyshev  polynormals. 


m  = 


t-ti 


ht 


where  hi  is  the  constant  increment  of  t. 

X  =  g{t)  =  G{i) 

From  n  data  points,  the  least  square  polynomial  approximation  function  G  can  be 
established  by  a  linear  combination  of  m  basis  functions;  we  use  the  discrete  version 
of  the  Chebyshev  polynomials'^  with  weight  function  w{t)  =  1  as  follows; 

m 

OtTi(f) 

t=i 

where  m  <  n  and  the  Ti{t)  are  the  discretely  orthogonal  Chebyshev  polynomials. 

The  Chebyshev  polynomials  are  defined  as  follows: 

If  =m  (m  =  0,l,2,---,Ar)  and  111(11)  =  1,  then 


Tn{u)  = 

Tn=0 


u\{N-m)l 
(it  —  m)!  N\ 


with  the  recurrence  relationships: 


To{u)  =  1 

2u 

(n  +  1){N  -  n)Tn+i(it)  =  (2n  +  l)(Ar  -  2u)T„{u)  -  n{N  +  n  +  l)T„_i(ii) 

Using  the  discrete  orthogonality  property  of  the  Chebyshev  polynoimals'^,  coefficient 
aj  can  be  obtained  as  follows: 


■  Er=i  mmu) 


# 


# 
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where  1  <  j  <m.  Since  no  matrix  inverse  is  required,  and  owing  to  the  completeness 
of  these  polynomials,  it  is  well  known  that  most  smooth  functions  can  usually  be 
approximated  accurately  using  a  modest  degree  (n).^ 

We  can  find  g{t)  from  G(t),  since  g{t)  =  G{t{t)).  Using  this  least  square 
approximation,  we  can  find  a  continuous,  differentiable,  analytical  solution  Xb{t) 
which  interpolates  or  lies  very  near  the  given  n  discrete  numerical  x,  approximate 
solutions  of  Eq.(l).  Of  course  this  analytical  expression  Xb{t)  does  not  satisfy 
exactly  the  Eq.(l).  However,  substituting  Xb{t),  Xb{t),  Xb(t)  into  the  equation 
e(t)  =  x(t)  -  fix{t),x{t),i)  allows  us  to  determine  an  analytical  function  for  the 
perturbation  term  e{t)  which  appears  in  the  following  differential  equation: 

x(t)  =  f{x{t),x{t),t)  +  e(t)  =  F{x,x,t)  (2) 

Since  f{x{t),x{t),i)  is  given  and  e(t)  is  an  available  algebraic  function,  F{x,x,t)  is 
available.  Now  Xb{t)  satisfies  Eq.(2)  exactly,  and  finally,  this  known  function  Xb{t) 
is  a  neighbor  of  the  original  numerical  solution  {xi,  X2,  •••,  in}-  By  algebraic 
construction  the  function  e{t)  =  Xb{t)  -  f{xb{t),ibit),t)  is  known  analytically 
and  therefore  we  know  this  small  forcing  function  at  all  t,  and  obviously,  we 
know  “how  small”  e(t)  is.  This  function  is  programmed  and  Eq.(2)  can  then  be 
solved  by  numerical  methods  and  the  resxilts  can  be  compared  to  the  known  exact 
Xb{t),  Xb{t).  The  above  mathematical  procedure  can  be  performed  successfully  using 
computer  symbol  manipulation®,  this  is  especially  important  for  the  generalizations 
to  consider  hybrid  differential  equations.  Now  Eq.(2)  is  a  benchmark  problem 
of  Eq.(l)  and  Xb{t),  Xb{i),  satisfy  Eq.(2)  exactly.  We  obviously  want  the 

perturbation  function  e(i)  to  be  as  small  as  possible,  i.e.,  the  benchmark  problem 
is  not  only  a  near  neighbor  of  the  original  discrete  solution,  but  it  also  very  nearly 

satisfies  the  given  differential  equations. 

The  previous  least  square  approximation  method  has  often  been  found  to  give 
poor  results  near  the  ends  of  the  interval.  This  poor  fit  may  cause  a  relatively  large 
e(<)  near  the  initial  and  final  times.  To  avoid  this  problem  we  integrate  Eq.(l) 
over  the  enlarged  interval  to-  <t<tf+  (where  to-  <  to,  tf+  >  if)  and  use  these 
numerical  results  as  generators  for  analytical  solutions  over  the  original  interval 
(to  <  t  <  if).  Experience  indicates  that  a  20%  “enlargement” {(t/+  -  to-)  > 
1.2(t/  —  to)}  is  almost  edways  sufficient  to  support  good  interpolation  over  the 
original  interval  (to  <  t  <  t/).  If  the  measure  of  e(t)  is  judged  too  large  then  we 
increase  the  number  of  Chebyshev  polynomials  m  to  reduce  e(t)  over  the  whole 
interval,  or  “start  over”  by  attempting  to  find  a  better  approximate  numerical 
solution  to  initiate  the  process.  Figures  1  and  2  provide  logical  flow  charts  showing 
construction  of  a  benchmark  problem  and  associated  convergence  study. 

Now  we  demonstrate  the  idea  using  a  simple  nonstiff  problem.  We  use  the 
Runge-Kutta  4th  order  method  with  fixed  step  size,  therefore  we  have  the  most 
common  case  that  the  integration  control  parameter  is  simply  the  step  size  h.  The 
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relationship  between  step  size  h  and  the  global,  or  true  errors  gives  us  the  infomation 
about  the  critical  value  for  h  and  the  accuracy  of  the  numerical  solution.  We 
consider  the  following  nonlinear,  nonautonomous  second  order  differential  equation. 

i  = -a;  -  0.1(1  +  a;^)i  +  0.1®^  +  (3) 

where  ®(0)  =  1  and  i(0)  =  0,  and  we  seek  the  solution  over  the  interval  0  <  t  <  10. 
We  convert  Eq.(3)  to  a  first  order  system  as  follows; 


Xi  =  X2 

=  -Xi  -  0.1(1  +  ®?)®2  +  O.lx?  +  Sin3t 


(4) 


where  ®i(0)  =  1  and  ®2(0)  —  0.  i  i. 

First,  we  solve  Eqs.(4)  using  the  Runge-Kutta  4th  order  method  to  evaluate  the 

candidate  discrete  approximate  solution.  Here  we  use  121  data  points  over  the  20% 
enlarged  time  interval  -1  <  f  <  11.  Second,  we  establish  a  continuous,  differen¬ 
tiable,  analytical  expression  for  interpolating  ®ii(f)  from  the  discrete  approximate 
solution  ®i(<).  We  use  a  degree  30  Chebyshev  polynomial  approximation  for  the 
least  square  fitting.  Finally  we  substitute  ®u(t),  ii6(<),  Eq.(3)  and 

symbolically  determine  the  function  e{t)  which  appears  in  the  following  equation. 

x  =  -X-  0.1(1  +  x^)x  -f  0.1®^  -t-  sinSt  -f  e  (5) 

To  use  the  Runge-Kutta  method,  Eq.(5)  can  be  converted  to  a  first  order  system 
as  follows: 

®i  =  ®2  (6) 

-  0.1(1  -I-  x\)x2  -f-  0.1®?  +  sin3t  +  e 


Now,  Eq.(5)  serves  as  a  benchmark  problem  for  Eq.(3),  because  we  know 
functions  ®b(t)  and  e(t)  which  satisfy  Eq.(5)  exactly.  Upon  solving  Eqs.(6) 
numerically  with  various  values  chosen  for  fi,  and  using  the  benchmark  imti 
state  as  initial  conditions  {®i(0)  =  ®6(0),  ®2(0)  =  ®6(0)},  we  can  establish  the 
relationship  between  step  size  and  global  error.  When  we  use  the  poi^twise  error 
in  the  root  mean  square  sense,  we  are  led  to  the  results  in  Fig.3  which  shows  the 
global  error/step  size  relationship  on  a  log/log  scale.  The  rate  of  convergence  on 
a  log/log  scale  is  4  in  this  problem;  this  coincides  with  the  fact  that  an  rth  order 
method  should  have  a  global  error  of  in  the  absence  of  arithmetic  errors 

The  critical  value  for  step  size  is  about  0.001;  if  h  decreased  below  0  001,  then  the 
results  deteriorate  due  to  the  round-off  error.  The  exact  solution  of  this  benchmark 
problem  and  simulation  errors  are  shown  in  Figs.5  and  6.  To  study  the  robustness  of 
the  convergence  characteristics  of  Fig.3,  we  introduce  relatively  large  perturbations 
in  the  initial  conditions  and  the  nonautonomous  term  in  the  differential  equation 
as  follows: 
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(7) 


x  =  -x-  0.1(1  +  x^)x  +  O.lx^  +  1.2sinM 

where  x(0)  =  1.2  and  x(0)  =  0.2  over  the  interval  0  <<<  10. 

After  using  the  same  procedure  to  vary  the  step  size  and  therefrom  we 
obtain  the  global  error/step  size  relationship  shown  in  Fig.4.  Notice  that  Fig.3 
and  Fig.4  are  almost  identical.  In  other  words,  both  the  critical  va.lue  h  and 
the  associated  accuracy  are  essentially  unchanged,  even  though  we  introduced 
larKe(20%)  perturbations  in  the  initial  conditions  and  in  the  forcing  term  of  the 
differential  equation.  Obviously  these  results  are  problem  dependent,  but  a  similar 
process  will  provide  the  needed  insight  for  other  problems. 

Now  we  apply  this  idea  to  an  idealized  three-body  distributed  parameter 
system.  The  main  difference  is  that  there  are  two  independent  variables  for  space 
and  time.  Therefore,  the  least  square  approximation  method  must  be  generalized 
to  deal  with  two  independent  variables.  In  order  to  obtain  an  approximate 
candidate  discrete  solution,  we  use  linear  quadratic  regulator(LQR)  to  design 
control  forces  and  we  use  the  finite  element  approach  for  space  discretization.  From 
this  approximate  solution,  we  construct  a  smooth,  differentiable,  analytical  solution 
which  is  physically  meaningful.  We  investigate  the  exact  space/time  distribu  ion 
of  errors  of  the  numerical  simulation  using  Newmark  method  with  finite  element 
modeling. 

A  THREE-BODY  DISTRIBUTED  PARAMETER  SYSTEM 
Now  we  demonstrate  the  idea  on  an  idealized  three-body  distributed  parameter 
system.  With  reference  to  Fig.7,  we  consider  a  rigid  hub  with  a  cantilevered  flexi  e 
appendage  which  has  a  finite  tip  mass.  Table  1  summarizes  the  configuration 
parameters  of  this  flexible  structure. 

Table  1  Configuration  Parameters  of  a  Three-Body  Problem 


PARAMETER 

SYMBOL 

VALUE 

Hub  radius 

r 

1  ft 

Rotary  inertia  of  hub 

Jh 

8slug-ft^ 

Mass  density  of  beam 

p 

0.0271875  slug/ft 

Elastic  modulus  of  beam 

E 

0.1584x10^“  Ib/ft^ 

Beam  length 

L 

4.0  ft 

Moment  of  inertia  of  beam 

I 

0.4709502797x10-^  ft^ 

Tip  mass 

mt 

0.156941  slug 

Rotary  inertia  of  tip  mass 

Jt 

0.0018  slug-ft^ 

The  appendage  is  considered  to  be  a  uniform  flexible  beam  and  we  make 
the  Euler-Bernoulli  assumptions  of  negligible  shear  deformation  and  negligible 
distributed  rotatory  inertia.  The  beam  is  cantilevered  rigidly  to  the  hub.  Motion  is 
restricted  to  the  horizontal  plane  and  we  neglect  the  velocity  component  -y6,  that 
is  perpendicular  to  the  y  direction.  The  control  system  is  assumed  to  generate  a 
torque  u  acting  upon  the  hub,  a  torque  uup  and  a  force  ftip  acting  upon  the  tip 

mass,  and  a  distributed  force  density  /  acting  upon  the  appendage.  We  assume 
small  elastic  motions  viewed  from  the  hub-fixed  rotating  reference  frame.  Overdots 
denote  derivatives  with  respect  to  time  and  primes  denote  derivatives  with  respect 
to  the  spatial  position. 

The  kinetic  and  potential  energies  of  this  hybrid  system  are  as  follows; 

2T  =  Jh6^  +  f  [p{y  +  {x+r)0y]dx  +  mt{y{L)  +  {r  +  L)6y +Jt{0+y'{L)y{S) 
Jo 

2V=  l\EI{y"f}dx  (9) 

Jo 

The  nonconservative  virtual  work  of  this  system  is  given  by 


8Wn 


I^L 

=  {u+  f{x){x +T)dx  ^{L-\-T)ftip+utip}S6 

Jo 

+  /  f{x)Sy  dx  -h  ftip6y{L)  +  utipSy'{L) 

Jo 


(10) 


Using  an  explicit  version  of  the  classical  Lagrange’s  equation  for  hybrid 
coordinate  distributed  parameter  systems^®,  the  governing  differential  equations 
and  the  boundary  conditions  are  obtained  efficiently. 

+  p[x  +  r){y  +  {x+r)e)dx  +  Tnt{L  +  r)(^L  +  r)e  +  y{L'^  +Jti0  +  y'{E)) 


=  u  +  /  +T)dx  +  {L  +  r)ftip  +  utip 

Jo 

p{y  +  (2:  -t-  r)6}  -f  Ely""  =  / 


El 


d^y 

dx^ 


—mt{{L  *f  7')^  +  y{L)}  +  ftip  *-  0 


(11) 

(12) 

(13) 


El 


d^y 

dx^ 


+Jt{e+y'{L)}-uup  =  0 

L 


(14) 


Notice  that  if  we  knew  an  explicit,  differentiable  solution  for  the  motion 
variables  {a/(x,t),^(t)},  then  the  Eqs.(ll-14)  can  be  solved  directly  and  ex¬ 
actly  for  the  four  corresponding  time  and  space  varying  forces  and  moments 
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{u{t),f{x,t),utip{i),fup{t)}  thus  yielding  the  desired  inverse  solution.  Since  we 
are  interested  in  physically  meaningful  problems,  we  do  not  wish  to  randomly  guess 
the  solution  Motivated  by  the  above  results  for  ODEs,  we  will  con¬ 

struct  an  exact  solution  which  is  a  near  neighbor  of  a  given  approximate  solution. 
First  we  consider  a  conventional  path  to  construct  the  approximate  solution. 

FINITE  ELEMENT  APPROACH 

Using  the  FEM,  the  partial  differential  equations  of  the  motion  are  transformed 
into  an  approximate  set  of  second-order  differential  equations  in  terms  of  the 
displacements,  velocities,  and  accelerations  of  the  finite  element  coordinates,  and 
the  external  forcing  functions.  Several  finite  element  models  for  a  flexible  arm 
are  presented  in  Refs.[ll)  and  [12].  In  this  section,  we  will  develop  a  finite  element 
model  for  a  hub  with  an  appendage  and  a  tip  mass  by  using  the  extended  Hamilton’s 
principle  that  provides  a  variational  weak  form  for  the  finite  element  model.  It  is 
significant  to  note  that  we  carefully  introduce  the  finite  element  approximations  in 
such  a  way  that  large  hub  rotations  are  admitted;  the  FEM  represents  small  elastic 
displacements  with  respect  to  hub-fixed  axis. 

The  application  of  the  extended  Hamilton’s  principle  yields 


[  \6T~6V +  SWnc)dt  =  0,  6e  =  8y  =  0  at  t  =  ti,t2  (15) 

Jti 

Substituting  Eqs.(8-10)  into  Eq.(15)  and  integrating  by  parts  gives 

£■  [ [HV  H-  (.  +  rm  +  -fh]  ^ 

+  ly*  yo(x  +  r) +  (x  -f  r)6^dx  +  JhO  +  mt{L  +  r) (y(L)  +  (L  + 

I  +  +  r)dx  +  {L  +  r)ftip  + 

(t/(L)  +  (L  + 


dt  =  0 


(16) 

The  displacement  y{x,t)  can  be  discretized  using  a  finite  element  expansion 


13,14 


(17) 


i=l 
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where  transverse  deflection  and  rotation  at  the  left  (right) 

end  of  the  element,  and  are  the  Hermite  cubic  polynomial  shape  functions 

which  satisfy  the  conditions  for  the  admissibility  and  that  are  defined  over  the 
finite  element. 

The  acceleration  and  curvature  are  expressed  as  follows: 


y{x,t)  = 

t=l 


d'^y 

dx^ 


1=1 


The  following  cubic  functions  are  adopted  as  the  shape  functions  for  i-th  finite 
element^"^ 


=  1  -  3xi  +  2x?,  ^>2  =  hxi  -  2hxj  +  hx\ 

ipi  =3xi  -2xi,  ij}4  = -hxi  +  hxi ,  xi  =  {x-xi)fh 

where  Xi  is  the  distance  from  the  root  of  the  appendage  to  the  left  end  of  the  i- 
th  finite  element,  and  h  is  the  length  of  the  finite  element.  These  are  the  most 
commonly  used  shape  functions  for  one-dimensional  beam  elements. 

Substitution  of  Eqs.(17-19)  into  Eq.(16)  and  carrying  out  the  spatial  integra¬ 
tions  yield  the  global  mass,  stiffness  and  forcing  matrices.  After  some  algebra,  the 
assembled  matrix  differential  equation  is  as  follows: 

'JK  +  Me9  Me.]  .  [O  ^  1 

M.e  M..\  \kj  [O  \i/J 

So  /(®)(®  +  ’’) 

Jo"  +  St 

So  Sh  /(®)'^2 

(20) 

where  u  is  the  coordinate  which  consists  of  the  transverse  deflection  and  rotation 
at  each  node  of  the  appendage,  and  the  matrix  elements  of  Eq.(20)  are  presented 
in  the  Appendix. 


1  {r  +  L)  1 
0  0  0 


1 

0 


CONSTRUCTION  OF  A  CANDIDATE  DISCRETE  SOLUTION 


We  can  find  a  physically  meaningful  approximate  solution  by  using  any  given 
approximate  forward  solution  process.  For  simplicity,  we  assume  that  only  the  hub 
torque  u{t)  is  non  zero.  Then  Eq.(20)  can  be  written  in  a  linear  second  order  matrix 

form  as  follows;  ,  , 


Mx  +  Ax  = 


1 

0 


U 


(21) 


where  ^  . 

We  design  a  typical  control  law  using  the  linear  quadratic  regulator(LQR),  and 
modal  coordinates  are  used  to  design  controller.  To  perform  the  modal  coordinate 
transformation,  the  following  open-loop  eigenvalue  problem  should  be  solved  first 


K^.  =  XiM^.  t  =  l,2,---,n  (22) 

with  the  normalization  equation 

=  1  i  =  l,2,---,n  (23) 

We  introduce  the  modal  matrix 

The  general  modal  coordinate  transformation  is  then 

x(t)  =  ^^(t)  (25) 

where  T]{t)  is  the  n  x  1  vector  of  modal  coordinates. 

The  transformed  equation  of  motion  becomes 

Miy  -I-  Kti  =  Du  (26) 


where 

M  =  =  1,  k  =  - 


D  = 


1 

0 


Note  that  diagonal  zero  in  K  corresponds  to  the  rigid  body  mode.  For  control 
applications  the  system  dynamics  are  usually  modeled  as  first  order  state  space 
differential  equations.  We  introduce  the  “2n”  dimensional  modal  state  vector 


Eq.(26)  can  be  written  as  the  first  order  system 

i  =  Az  +  Bu 


(28) 


where 


A  = 


1 

o 

B  = 

q  ■ 

[-K  0^ 

b 

We  adopted  the  following  performance  index  for  the  LQR  control  design: 

|»00 

J  =  f  (z^Qz  +  vFRu)  dt 
Jo 


(29) 


with 


Q  = 


a  0 

0  In 


R  =  1 


where 

The  above  performance  index  is  an  energy  type,  since  the  first  term  and  second 
term  in  the  performance  index  corresponds  to  the  state  energy  and  the  control 

energy  respectively.  i  •  j 

By  solving  the  Riccati  equation^®,  the  optimal  feedback  control  is  obtained 

u  =  —gz  (^*^) 

Now  we  can  solve  the  initial  value  problem  using  a  time  discretization  pro- 
cess(e.g.  Runge-Kutta)  and  through  Eqs.(17,25,30)  we  obtain  y{xi,ti),  ^(ti)  and 
at  discrete  points  in  space  and  time.  The  approximate  motion  {y{xi,  U),  0(<i)} 
corresponds  to  the  system  response  to  a  hub  torque  designed  to  maneuver  the  sys¬ 
tem  and  arrest  vibration. 

CONSTRUCTION  OF  A  BENCHMARK  PROBLEM 

We  want  to  construct  a  continuous,  differentiable,  analytical  solution  that  has 
physical  meaning.  A  candidate  discrete  approximate  solution  for  the  hybrid  system 
can  be  obtained  using  any  given  approximate  forward  solution  process  and  a  given 
controller.  This  approximate  solution  can  be  used  as  a  generator  for  a  near  y 
smooth  space/time  motion  for  which  we  can  determine  the  exact  forces(required  to 
be  consistent  with  this  prescribed  motion  and  the  exact  equations  of  motion).  Least 
square  approximation  associated  with  using  the  discrete  version  of  the  Chebyshev 
polynomials  can  be  invoked  to  obtain  the  smooth  motion  f{x,y)  solution  from 
the  discrete  solution.  While  we  invoke  a  least  square  approximation  to  construct 
the  smooth  f{x,y)  from  an  already  approximate  discrete  solution,  we  subsequently 
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determine  the  modified  forces  to  be  exactly  consistent  with  this  motion  f{x,y).  We 
first  consider  the  least  square  process. 

There  are  n'  X  m'  discrete  data  points  such  as 

zu  =  ^12  =  *‘*5  ^im>  =  /(a^i,t/mO 

Z2I  =  ^22  =  f{^2^y2)^  •••1  ^2m^  =  f{^2^ym*) 

Zn*l  =  ^n'2  =  f{^n*^y2)^  '•*7  ^n'm'  =  /(a^n' j  2/mO 

where  Xj,  yj  are  equally  spaced  independent  variables. 

How  can  we  reliably  compute  a  continuous,  differentiable,  analytical  function 
/  from  the  data  points  in  the  least  square  sense?  Analogous  to  the  ODE  case,  we 
elect  to  make  use  of  discrete  orthogonality.  We  nondimensionalize  (x,y)  using 


x{x)  = 


y{y)  = 


y-yi 

hy 


where  hx^  hy  are  the  increments  of  x  and  y  respectively. 

=  fi^^y)  =  P'i^^y) 

From  two-dimensional  71^  xtti^  data  points,  the  function  F  can  be  approximated 
by  p  X  q  two-dimensional  basis  functions  that  come  from  the  discrete  version  of  the 
Chebyshev  polynomials  [weight  function  it7(a;)  =  1]  as  follows: 

F(x,y)s'£J^biiT,(x)Ti(y) 

1=1  i=i 

where  p  <  n',  q  <  m'  and  T,{*)  is  the  univariate  Chebyshev  polynomial  in  the 
discrete  range. 

We  use  the  previous  definition  of  Chebyshev  polynomials  and  the  recurrence 
relation.  Using  discrete  orthogonality  properties  of  Chebyshev  polynomials,  the 
typical  coefficient  brs  can  be  obtained  as  follows: 

Ei,  eS. 

”  E£i  Eri,  T,(£i)r,(w)r.(xi)r,(Sj) 

where  l^r<p,  1  ^  s  < 

We  can  find  /(a;,y)  from  F{x,y),  since  f{x,y)  =  F{x{x),y{y)). 

Using  the  previous  method  associated  with  the  Chebyshev  polynomials,  we 
interpolate  a  smooth  differentiable  function  yb{x,t)  as  a  two- variable  orthogonal 
function  expansion  which  passes  near  the  y{xi,ii)  points.  Similary,  we  can  interpo¬ 
late  a  smooth  differentiable  function  6b{t)  from  6{ti)  data  points.  Since  yb{x,t)  and 


Obit)  are  smooth,  differentiable  functions,  we  can  force  them  to  be  exact  solutions 
of  our  dynamical  model  by  simply  substituting  yb{x,t),  and  their  space/time 
derivatives  into  Eqs.(ll-14)  and  solving  the  four  equations  analytically  for  four  new 
forces  which  satisfy  these  equations  exactly.  Com¬ 

puter  symbol  manipulation  makes  this  process  possible. 

SIMULATED  RESULTS 

First  we  find  a  candidate  discrete  solution  for  the  enlarged  time  interval 
<  f  <  2)  with  initial  conditions  ^(—1)  =  O.lrad  and  y(a:,— 1)  =  0  for  all  x.  We 
use  LQR  to  design  control  force  u{t)  and  use  the  finite  element  approach  for  space 
discretization.  Here  we  use  1  for  q  of  Eq.(29)  and  use  the  configuration  parameters 
as  shown  Table  1.  Then  we  construct  a  benchmark  problem  for  time  interval 

(0  <  <  <  2).  Figures  8-13  show  yb{x,t),  6b{i),  “(0)  fnpi^) 

which  satisfy  Eqs.(ll-14)  exactly.  Note  that  even  though  we  use  the  enlarged  time 
interval  and  have  good  interpolations  for  dbit)  and  yb{x,t)  near  the  boundary,  there 
exists  relatively  large  error  for  control  forces,  near  the  boundary,  compared  to  the 
nonlinear  ODE  cases.  This  is  due  to  the  fact  that  we  have  two  independent  variables, 
time  and  space,  and  have  coupling  terms  which  are  time  and  space  derivatives  of 
yb(x,t)  in  the  evaluation  of  control  forces.  In  contrast  to  enlarging  the  time  interval 
for  ODE  problems,  it  is  neither  physically  nor  mathematically  meaningful  to  enlarge 
the  spatial  domain.  As  will  be  evident,  this  is  a  minor  problem,  and  does  not  prevent 
us  from  establishing  “exact”  benchmark  problems. 

Finite  element  approach  gives  us  Eq.(20)  and  for  simulation  we  use  step- 
by-step  solution  using  Newmark  integration  method.  Given  initial  conditions 
{2/(®,0)  =  2/6(a:,0),  0(0)  =  06(0)}  and  force  functions  {«(<),  f{x,i),  ftip{t)}, 

the  approximate  simulation  of  this  structure’s  dynamics  {ys{x,t),  0,(t)}  can  pro- 
ceed.  Figure  14  shows  the  space/time  error  distribution  ey{x^t)  =  t)  — 
when  we  use  20  finite  elements  and  0.002  sec.  for  step  size. 

Second  we  find  a  candidate  solution  for  the  enlarged  time  interval  (0  <  t  <  0.1). 
Initial  condition  for  0  is  O.lrad  and  the  third  natural  mode  of  this  fle^^ble  structure 
is  used  for  y{x,0).  We  use  LQR  to  design  control  force  u{t)  and  FEM  is  used  for 
sapce  discretization.  Here  we  use  100  for  q  of  Eq.(29)  and  use  the  configuration 
parameters  as  shown  Table  1  except  mt  and  Jt  (mt=0.256941,  Jt=0.0028).  Then 
we  construct  a  benchmark  problem  for  time  interval  (0  <  t  <  0.08),  i.e.,  we  have 
new  set  t/t(x,<),  06(<),  and  {n(t),  /(x,t),  uup{t),  fup{t)}  which  satisfy  Eqs.(ll-14) 
cxsictly 

Now  we  can  investigate  the  convergence  errors  in  a  family  of  approximate 
solutions  with  special  case  absolute  standards.  When  we  use  the  Newmark 
integration  method  with  finite  element  modeUng,  the  convergence  and  accuracy 
behavior  is  studied  as  a  function  of  the  number  of  finite  elements  and  the  integration 
step  size.  Figure  15  shows  the  error  norm  ||e(,]|  and  ||ej,ll  for  various  mesh  sizes  for 
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a  fixed  integration  step  size  on  a  log/log  scale.  Figure  16  shows  the  error  norm 
lle^ll  and  ||eyll  for  various  integration  step  sizes  for  a  fixed  number  of  finite  elements 
on  a  log/log  scale.  The  error  norm  distribution  of  9  and  y  is  shown  m  Figs.l7,  18 
respectively,  as  a  function  of  DT(time  step  size)  and  H(mesh  size). 

Here  we  introduce  the  following  definitions  for  the  supmetric  error. 

lleo(t)llL=(o,T)  =  ee{tfdt^ 

lle3,(x, OlUno, T;r->)  =  ^  ey{x,tfdxdt^ 


where  efl(t)  =  ^s(f)  ~  ^fi(f)- 

The  relative  errors  are  defined  as  follows: 


Moyw)  HE 


ll3/(®)*)llt’(0,T;L’) 


We  observe  that  the  rate  of  convergence  is  2  in  /lt(decrease  DT  to  reduce  error 
measure)  and  4  in  ^(decrease  H  to  reduce  error  measure)  from  Figs.15  and  16,  except 
for  the  small(/lt,  h)  region  where  arithmetic  errors  dominate  and  provide  computer 
limitations  to  accaracy.  It  is  tins  latter  insight  that  is  essentially 
obtain  by  pre-existing  methods,  but  is  easily  estabbshed  by  the  methods  ol  his 
paper.  We  should  be  careful  in  saying  that  adjusting  h  (to  achieve  accuracy)  is  less 
expensive  than  adjusting  At,  because  the  rate  of  convergence  of  4  in  h  and  the  rate 
of  convergence  of  2  in  At  does  not  guarantee  this  fact.  Each  approach  to  improving 
accuracy  results  in  different  amount  of  computational  load,  which  depends  on  the 
specific  program.  From  Figs.15-18,  we  can  also  notice  that  if  H  is  too  crude  then 
At  reduction  does  not  improve  the  solution  and  if  DT  is  too  big  then  /i  reduction 
does  not  improve  the  solution.  The  numerical  results  indicate  that  the  minimum 
value  of  REe  is  0.7  x  IQ-’  (when  H=0.2  and  DT=0.00002)  and  the  minimum  value 
of  REy  is  0.3  X  10-®  (when  H=0.4  and  DT=0.00005).  We  know  of  no  method  that 
could  give  this  insight  before  the  introduction  of  the  present  method. 

We  construct  a  neighboring  benchmark  problem  to  investigate  the  robustness  o 
the  convergence  characteristics  of  Figs.15-18.  To  construct  a  neighboring  benchmark 
problem,  first  we  find  a  candidate  discrete  solution  with  the  following 
condition  and  forcing  function  u(t).  Comparing  to  the  previous  case,  we  make  a  10% 
increase  of  the  initial  condition  y(a:,  0)  and  arbitrarily  add  a  sinusoidal  perturbation 
term  0.4186sm(27rt/0.08)  to  the  previous  hub  control  u[i)  for  a  new  perturbed  hu 
control.  The  error  norm  distributions  of  the  perturbed  case  are  almost  identical 
to  the  previous  problem.  So  we  can  conclude  that  the  convergence  and  accuracy 
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properties  of  this  approximate  solution  process  are  indeed  relatively  invariant  in  the 
presence  of  these  finite  perturbations,  in  this  case. 

SUMMARY  AND  CONCLUSION 

The  present  paper  introduces  an  inverse  dynamic  method  for  constructing  exact 
special  case  solutions  for  hybrid  ODE/PDE  systems.  A  multi- variable  orthogonal 
function  expansion  method  and  computer  symbol  manipulation  are  successfully  used 
for  this  process.  The  hybrid  ODE/PDE  systems  with  exact  solutions  can  serve  as  a 
benchmark  problem  to  validate  approximate  solution  methods.  This  methodology 
makes  it  possible  for  one  to  rigorously  determine  exact  solution  errors  and  to  study 
the  convergence  and  accuracy  behavior  as  a  function  of  tuning  parameters  for 
a  class  of  ODE/PDE  systems  for  which  the  initial  value  problem  is  not  exactly 
solvable.  Numerical  examples  indicate  that  a  rigorous  error  analysis  is  obtained  not 
merely  for  one  nominal  solution,  but  for  a  substantial  neighborhood  of  the  nominal 
solution.  By  constructing  a  family  of  neighboring  benchmark  problems,  one  can 
obtain  valuable  information  about  the  convergence  and  accuracy  properties  that 
are  relatively  invariant  with  respect  to  perturbations  within  a  known  bound. 
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appendix 

Submatrix  Elements  of  Finite  Element  Method 
The  local  mass  and  stiffness  matrices  of  the  i-th  element  of  the  appendage  is  defined 
as  follows; 
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where  Xi  is  the  distance  from  the  root  of  the  appendap  to  the  left  end  of  the  t-th 
finite  element,  r  is  the  radius  of  the  hub,  and  h  is  the  length  of  the  finite  elemen  . 
The  matrix  due  to  the  tip  mass  is  defined  as  follows: 
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Now,  the  submatrices  in  Eq.(20)  can  be  defined  as  follows. 
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where  N  is  the  number  of  finite  elements. 
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Fig.  13  Force  applied  at  tip  fupit) 
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Fig.  1  Dual  robot  cooperative  manipulation  example. 
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la.  Governing  Equations 

For  natural  systems,  the  discrete  coordinate  version  of  Lagrangian  Mechanics  leads 
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multaneously  for  the  unknown  vectors  q{t)  and  X{t). 
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the  unconstrained  system  equations  read 

M{q)q  +  H{q,q)  =  B{q) 
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subject  to  the  payload  equations  and  the  prescribed  end  conditions  for  a  rest-to-rest 
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Using  that  A  =  Ai  {q,  q)  -  A2  (g)  u,  and  the  lagrange  multiplier  rule  leads  to  the  necessary 
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globally  asymptotically  stable 
cumbersome  to  implement 
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the  control  provides  for  stable  tracking. 

this  control  law  allows  attractors  other  than  the  origin. 
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Abstract 

We  consider  the  simultaneous  slewing  and  vibration 
suppression  control  problem  of  an  idealized  structural 
model  which  has  a  rigid  hub  with  two  cantilevered  flexi¬ 
ble  appendages  and  finite  tip  masses.  The  finite  element 
method(FEM)  is  used  to  obtain  linear  finite  dimen¬ 
sional  equations  of  motion  for  the  model.  In  the  linear 
quadratic  regulator(LQR)  problem,  a  simple  method  is 
introduced  to  provide  a  physically  meaningful  perfor¬ 
mance  index  for  space  structure  models.  This  method 
gives  us  a  mathematically  minor  but  physically  impor¬ 
tant  modification  of  the  usual  energy  type  performance 
index.  A  numerical  procedure  to  solve  a  time-variant 
LOR,  problem  with  inequality  control  constraints  is  pre¬ 
sented  using  the  method  of  particular  solutions. 

Introduction 

The  problem  of  simultaneous  slewing  and  vibration 
suppression  of  large  flexible  space  structures  has  been 
the  focus  of  intense  research^-'*.  Since  Large  Space 
Structures(LSS)  are  mechanically  flexible  systems,  they 
are  most  generally  described  as  hybrid  coordinate  dy¬ 
namical  systems.  Their  motion  is  described  by  a  cou¬ 
pled  sj'stem  of  ordinary  and  partial  differential  equa¬ 
tions.  The  corresponding  nonlinear  integro-differential 
equation  of  motion  are  usually  linearized,  discretized  in 
space,  and  truncated  to  a  finite  number  of  modes.  The 
assumed  mode  method  and  the  FEM  are  widely  used 
for  obtaining  discretized  linear  equation  of  motion  for 
large  flexible  structures. 

Several  approaches  to  associated  control  of  LSS  have 
been  investigated.  The  linear  quadratic  regulator  and 
associated  tracking  problems  have  been  treated  success- 
fullv  and  represent  an  important  class  of  optimal  con¬ 
trol  application®.  In  the  LQR  problem,  the  choice  of 
performance  index  is  very  important  and  problem  de- 

*  Graduate  Student,  Department  of  Aerospace  Engi¬ 
neering.  Student  Member  AIAA. 
t  Eppright  Chair  Professor,  Department  of  Aerospace 
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pendent  task.  Usually  LQR  problems  are  considered 
without  any  bounds  for  states  and  controls.  If  there  are 
inequality  constraints  on  the  controls,  however,  then 
Pontryagin’s  minimum  principle  could  be  applied  to 
find  the  necessary  conditions  for  optimality.  Unfor¬ 
tunately,  the  resulting  equations  from  the  optimality 
conditions  give  us  nonlinear  differential  equations  even 
though  the  original  system  of  equations  is  linear®.  For 
this  reason,  we  can  not  determine  controls  analytically. 
Rather,  we  must  attempt  to  find  the  solutions  by  an 
iterative  numerical  procedure. 

In  this  paper,  we  consider  the  simultaneous  slewing 
and  vibration  suppression  control  problem  of  a  rigid  hub 
with  two  cantilevered  flexible  appendages  which  have  fi¬ 
nite  tip  masses.  The  FEM  is  used  to  obtain  linear  finite 
dimensional  equations  of  motion  for  the  flexible  space 
structure  model.  We  introduce  a  simple  method  which 
provides  a  physically  meaningful  performance  index  for 
space  structure  models.  This  method  gives  us  a  mathe¬ 
matically  minor  but  physically  important  modification 
of  the  usual  energy  type  performance  index.  A  numer¬ 
ical  procedure  to  solve  a  time-variant  LQR  problem 
with  inequality  control  constraints  is  presented  using 
the  method  of  particular  solutions^-®.  We  ako  present 
simulated  results  to  explore  the  utility  of  this  method. 


Finite  Element  Modeling 

Using  the  FEM,  the  partial  differential  equations 
of  the  motion  are  transformed  into  an  approximate 
set  of  second-order  differential  equations  in  terms  of 
the  displacements,  velocities,  and  accelerations  of  the 
finite  element  coordinates,  and  the  external  forcing 
functions.  With  reference  to  Fig.l,  we  consider  a 
rigid  hub  with  two  cantilevered  flexible  appendages 
which  have  finite  tip  masses.  Table  1  summarizes 
the  configuration  parameters  of  this  flexible  structure. 
The  appendage  is  considered  to  be  a  uniform  flexible 
beam  and  we  make  the  Euler-Bernoulli  assumptions  of 
negligible  shear  deformation  and  negligible  distributed 
rotatory  inertia.  The  beam  is  cantilevered  rigidly  to 
the  hub.  Motion  is  restricted  to  the  horizontal  plane 
and  we  neglect  the  velocity  component  -yd,  that  is 
perpendicular  to  the  y  direction.  Several  finite  element 
models  for  a  flexible  arm  are  presented  in  Refs.[9] 
and  [10].  In  this  section,  we  present  a  finite  element 


81 


model  for  the  model  by  using  the  extended  Hamilton  s 
principle  that  provides  a  variational  weak  form  for 
the  finite  element  model.  It  is  significant  to  note 
that  we  introduce  the  finite  element  approximations 
in  such  a  way  (co-rotational  coordinates)  that  large 
hub  rotations  are  admitted;  the  FEM  represents  small 
elastic  displacements  with  respect  to  hub-fixed  axis. 


Fig.l  five-body  hybrid  coordinate  system 


Table  1  Configuration  Parameters 


parameter  symbol 

VALUE 

Hub  radius 

r 

1  ft 

Rotary  inertia  of  hub 

Jh 

8  slug-ft^ 

Mass  density  of  beam 

P 

0.0271875  slug/ft 

Elastic  modulus  of  beam 

E 

0.1584x10^°  Ib/ft* 

Beam  length 

L 

4.0  ft 

Moment  of  inertia  of  beam 

I 

0.4709503x10-^  ft'‘ 

Tip  mass 

Tilt 

0.156941  slug 

Rotary  inertia  of  tip  mass 

Jt 

0.0018  slug-ft^ 

The  application  of  the  extended  Hamilton’s  principle 
yields 

C\8T-SV  ^8Wr.c)di^0 

Jti 

88  —  8y  ^  0  at  t  —  t2 

The  displacement  y{x,t)  can  be  discretized  using  a 
finite  clement  expansion  ' 

t  =  l 

where  transverse  deflection  and 

rotation  at  the  left  (right)  end  of  the  element,  and 
are  the  Hermite  cubic  polynomial  shape  func¬ 
tions,  defined  over  the  local  element,  which  satisfy  the 
conditions  for  admissibility. 


Specifically,  the  following  cubic  functions  are 
adopted  as  the  shape  functions  for  the  t-th  finite 
element'^ 

rl)i  =  l-3xf  +  2xf,  V’2  = 

^3  =  3xJ  -  2xf, 

Xi  =  {x-  Xi)fh 

where  x,  is  the  distance  from  the  root  of  the  appendage 
to  the  left  end  of  the  i-th  finite  element,  and  h  is 
the  length  of  the  finite  element.  These  are  the  most 
commonly  used  shape  functions  for  one-dimensional 
beam  elements. 

As  a  consequence  of  the  space/time  separation  implicit 
in  Eq.(2),  the  acceleration  and  curvature  are  expressed 
as  follows: 


i-l 


(4) 


^  dx 


After  some  algebra,  the  assembled  matrix  differen- 
tial  equation  is  as  follows: 


Jh,  +  2M0e 
2Mu9 


2Meu 

2Muu 


la 


0 

2Ki,u 


(5) 


where  u  is  the  coordinate  which  consists  of  the  trans¬ 
verse  deflections  and  rotations  at  each  node  of  tbe  ap¬ 
pendage,  and  we  assume  symmetric  deformations  of  the 
appendages.  The  matrix  elements  of  Eq.(5)  are  pre¬ 
sented  in  Ref.[13].  The  control  system  is  assumed  to 
generate  a  torque  u  acting  upon  the  hub  and  a  torque 
utip  acting  upon  the  tip  mass. 


LQR  with  Inequality  Control  Constraints 

We  introduce  a  method  to  find  a  physically  mean¬ 
ingful  performance  index.  First  Eq.(5)  can  be  written 
in  a  linear  second  order  matrix  form  as  follows. 


(5) 

Modal  coordinates  are  used  to  design  the  controller. 
To  perform  the  modal  coordinate  transformation  , 


Mtc  +  Kx  — 


1  2 
0  0 

0  2  1 


the  following  open-loop  eigenvalue  problem  should  be 
solved  first 


Kcj).  =  A,M^.  i  =  l,2,--,n 

{^) 

• 

with  the  normalization  equation 

Af  .  —  1  i~l,2, 

(8) 

W^e  introduce  the  modal  matrix 

• 

(9) 

The  general  modal  coordinate  transformation  is 

then 

II 

(10) 

where  vW  is  the  n  x  1  vector  of  modal  coordinates. 

®  The  transformed  equation  of  motion  becomes 

Mfi+  kri=  Du  (11) 


where 


M  =  In 


D  = 


1 

0 


2- 

0 


0  2 


Note  that  diagonal  zero  in  K  corresponds  to  the  rigid 
body  mode.  For  control  applications  the  system  dy¬ 
namics  are  usually  modeled  as  first  order  state  space 
differential  equations.  We  introduce  the  “2n”  dimen¬ 
sional  modal  state  vector  z  =  then  Eq.(ll)  can 

be  written  as  the  first  order  system 


where 

«  =  [  0  «a.l  ■  2K-1 

Note  that  the  usual  energy  type  performance  ^index 
adopts  instead  of  as 

the  upper  left  submatrix  of  Q. 

We  assume  that  the  control  is  constrained  in  mag¬ 
nitude  by  the  relation 

K(01<1  j  =  l,2,--,m  (15) 

Note  that  the  B  matrix  of  Eq.(12)  and  the  R  matrix 
of  Eq.(14)  can  be  defined  to  obsorb  the  normalization 
Uj  to  allow  the  normalized  magnitude  of  wy(t)  to 
have  a  unity  saturation  limit,  without  restriction. 

The  Pontryagin’s  minimum  principle  consists  of  the 
state  and  costate  equations  and  the  optimaUty  condi¬ 
tion  as  follows: 

z'  =  Az"  -I-  B  u" 
p-  =  -Qz*  -  A'^p' 

B(z-,u-,p-,t)  <  B(z-.u,p-,t)  for  ail  admissible  u 

(16) 

where  H  is  the  Hamiltonian  function. 

The  solution  of  the  open-loop  problem  which  rep¬ 
resented  by  Eqs.(12,14,15)  must  satisfy  the  foUowing 
nonlinear  two  point  boundary  value  problem(TPBVP) 
derived  from  Pontryagin’s  minimum  principle®.  The 
detail  proof  of  Eq.(17)  is  in  the  Appendix. 

i-  =  Az*  -  B  SAT(JZ-^B’’p*) 
p”  =  — Qz'  —  A"^p 


z  =  Az  +  Bu  (1^) 


where 


4-[  '"1  . 

B  =  1 

■  0  ' 

• 

■  “  [-K  0  j  ’ 

D 

Now  the  kinetic  energy  and  potential  energy  are  as 
follows:  . 

T=-x^Mk,  V=U'^Kx  (13) 

2  ^ 

Usually  we  include  the  position  feedback  control- 
induced  potential  energy  term  since  we  expect 

the  control  to  drive  0  to  zero.  We  introduce  a  new 
weighting  matrix  Q  in  the  performance  index  J  as 
follows: 


J^  —  f  (aix^  Af  x  +  n2X^  A’x -f  dt 

^  Jo 

—  i  ^  Qz  +  R\x)  dt 

2  7o 


where  p  is  the  costate  vector  and  sai{yi)  is  defined  that 
5at(yi)  =  Vi  if  \yi\  <  1  and  sat(yO  =  sgn{yi)  if 
ly,j  >  1,  and  SAT()  is  a  similar  vector  valued  function. 

When  the  initial  condition  of  z*(t)  and  the  terminal 
condition  of  p’(0  are  assigned  as  z‘(0)  =  Zq  and 
=  hz*{tf),  the  method  of  particular  solutions 
associated  with  a  quasi-Unearization  method  gives  us 
the  open  loop  optimal  solution. 


Method  of  Particular  Solutioiis 

A  general  technique  for  solving  nonlinear  TPBVPs 
was  presented  in  [7,8].  The  method  of  particular  solu¬ 
tions  and  an  associated  quasi-linearization  method  are 
summarized  and  then  applied  to  LQR  problems  with 
inequality  control  constraints. 

First  consider  the  linear  differential  system 

v  =  F{i)v+D{i)  Q<t<tf  (18) 
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with  the  boundary  conditions 

Vt(0)  =  aj  (19) 

Cv{ij)  =  p  (20) 

where  (7  is  a  known  n  x  271-  matrix  and  /3  is  a  known 
constant  vector. 

Let  vJ(t)  (j  =  1,2,-*  *,714*1)  denote  714-1  particular 
solutions  obtained  by  forward  numerical  solution  of 
Eq.(18)  with  the  following  7i  4- 1  sets  of  initial  conditions; 

Vj(0)~ai  7=1,2,  ••♦,71  j=:1,2,---,7i4*1 

<  =  fc=l,2,..-,7i  jz.  1,2,. ••,71+1 

"  (21) 

where  Sjk  is  the  kronecker  delta. 

Due  to  the  linear  property  of  Eq.(18),  we  can  com¬ 
bine  the  71+1  particular  solutions  to  obtain  another 
solution 

(22) 

J=l 

The  unknown  coeflicients  kj^s  are  determined  in 
such  a  fashion  that  the  solution  v(t)  satisfies  the  bound¬ 
ary  conditions  of  Eq.(21).  From  the  initial  and  terminal 
conditions,  we  obtain  the  following  equations. 

n  +  l 

^  fcy  =  1 


of  Eq.(26)  are  satisfied  only  approximately,  then  the 
boundary  conditions  are  as  follows: 


Avi(O)  =  0 


i  =  1,2.  — ,71 


1  ]Av(t/)  = -4'(v„(t/)) 

Then,  Eqs.(28)  and  (29)  constitute  a  linear  differential 
system  and  can  be  solved  by  the  method  of  particular 
solutions.  In  order  to  avoid  numerical  differentiation 
Vn  in  Eq.(28),  we  can  rewrite  Eqs.(28)  and  (29)  using 
V  =  Vn  +  Av  as  follows: 

]v+|/(v„,t)-  j  |v„|  (30) 


with  boundary  conditions 


Vi(0)  =  Oi 


t  =  1,2.- 

.  ra't 


I'p  1 
^  v(t/)  = 


The  solution  v(t)  becomes  a  new  nominal  solution 
V„(t). 

Now,  we  consider  the  nonlinear  TPBVP  of  Eq.(17) 
with  boundary  conditions.  Let 


Equation  (23)  constitutes  7i+  1  equations  which  can 
be  solved  to  determine  the  n  +  1  fcy’s.  The  solution  is 
then  obtained  by  recombining  the  individual  particular 
solutions  according  to  Eq.(22). 

Second,  consider  the  nonlinear  differential  system 

v  =  /(v,t)  0<t<t/  (24) 

with  the  boundary  conditions 

Vi(0)  t  zi:  1,  2,  •  •  • ,  71  (25) 

^(v(ty))  =  0  (26) 

Equation  (24)  is  linearized  about  a  nominal  solution 
v„(t).  The  linearized  equations  are  given  by 

Vn  +  Av  = /(vn,  t)  +  ^  Av  (27) 

v„(t)J 

w+ere  Av  are  corrections  to  the  nominal  solutions. 
Eq.(27)  is  rew'ritten  as  follows 

Av  =  Av+ {/(v„,<)  -  v„}  (28) 

v„(oJ 

If  Vn(t)  is  selected  such  that  the  initial  conditions  of 
Eq.{25)  are  satisfied  exactly  but  the  terminal  conditions 


Vi(0)  =  zox  t-  1,2,  ••♦,71  ,  . 

hh  /n]v(t/)=0 

To  obtain  the  linearized  differential  equation,  we 
need  [|{|v«(t)]  of  Eq.(30).  For  the  case  of  the  LQR 
problem  with  inequality  control  constraints,  [|“lv«(t)] 
can  be  obtained  easily  by  the  following  procedure. 

By  the  presence  of  SAT  function  in  Eq.(17),  first  we 

evaluate  the  TTix  1  vector  .  Ifl(-R  )j|  ^ 

1  for  all;  =  1,2,  •♦.,771,  then  the  nonlinearity  of  Eq-(  17) 
disappears,  so  obviously 

ra/ 1  1  \A 

=  J 

If  there  are  jf’s  such  that  l(R”^B^p*)jj  >  1,  then  we 
define  a  7n  x  n  matrix  Y.  This  matrix  is  basically 
but  each  j-th  row  is  replaced  by  a  zero  row 
vector  when  j  is  the  index  such  that  ((R  ^B^p  )j  \  >  1* 
Then, 

@1 . 1-1-'.  --:n  " 

Substituting  Eq.(33)  into  Eq.(30)  gives  us  a  linearized 
differential  equation. 
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Simulated  Results 


We  consider  the  previous  flexible  structure  with 
reference  to  Fig.l  and  use  the  configuration  parameters 
as  shown  Table  1.  The  discretized  equations  of  motion 
are  presented  in  Eq.(5).  Here  we  use  3  finite  elements 
and  time  interval  (0  <  t  <  1)  with  initial  conditions 
fl(0)  =  0.2  rad  and  y{x,  0)  =  0  for  all  x.  We  use  1  for 
oi  and  02,  100  for  kg,  diag{5,S0)  for  R  of  Eq.(14).  We 
assume  that  the  controls  are  constrained  in  magnitude 

as  follows: 


WOl  ^  ^  0.015 

Figures  2-5  show  0{t),  ytip{i),  “W:  '^tip{t)  for 

both  cases  (constrained  control  case  and  unconstrained 
control  case).  The  first  four  state  and  cosUte  histories 
of  the  constrained  control  case  are  shown  in  Figs.6  and 
7  respectively.  Figure  7  shows  that  the  costates  satisfy 
the  terminal  condition  p'(t/)  =  0- 


Summary  and  Conclusion 

The  present  paper  introduces  a  simple  method  which 
provides  a  physically  meaningful  performance  index  for 
space  structure  models  in  the  LQR  problem.  This 
method  gives  us  a  reasonable  modification  of  the  usual 
energy  type  performance  index.  A  numerical  proce¬ 
dure  is  presented  to  obtain  open  loop  solution  of  the 
time- variant  LQR  problem  with  inequality  control  con¬ 
straints,  using  the  method  of  particular  solutions  incor¬ 
porated  with  a  quasi-linearization  technique.  This  ap¬ 
proach  does  explicitly  consider  control  saturation  con¬ 
straints  and  therefore  represents  a  generalization  of 
the  standard(unbounded)  control  assumptions  for  LQR 
problems.  Numerical  results  are  presented  which  shows 
the  utility  of  the  method,  using  the  idealized  struc¬ 
tural  model  which  has  a  rigid  hub  with  two  flexible 
appendages  and  finite  tip  masses. 
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The  LQR  problem  of  Eqs.(l2, 14,15)  can  be  written 
as  the  nonlinear  TPBVP  of  Eq.(17)  using  the  Pontrya- 
gin’s  minimum  principle. 

Pontryagin*s  minimum  principle: 

i'=Az'  +  Bu  (^1) 

p-  =  -Qz*  -  A^’p*  (A2) 

jr(z*,u',p*.t)  <  /f(z*,u,p*,£)  for  all  admissible  u 

(A3) 

where  H  is  the  Hamiltonian  function. 

From  Eq.(A3)  L{u%  Ru') -f  (Bu-,p')  <  i(u,Ru)-f 

{Cu,  p')  hold  for  all  u  such  that  luy(<)l  <  1  7  = 

l,2,--,m. 

Let  us  define  w*  as  w"  =  R  'R^p*,  then 

i(u',  Ru") -t- (u‘,  Rw")  <  -(u,  Ru) -F  (u,  Rw*) 

2  ^ 

Now  we  add  ,  Rw')  to  both  sides, 

i(u%  Ru')  +  (u‘,  Rw')  -F  i(w',  Rw*) 

<  l{u,  Ru)  -F  (u,  Rw')  -F  — (w*,  Rw  ) 

2  ^ 

((u'-Fw*),R(u--Fw*))  <  ((u-Fw'),R(u-Fw*))  (A4) 

for  all  u  such  that  juj  j  <  1  vrhere  j  —  1,  2,  •  ♦  • ,  m. 
Equation  (A4)  implies  that  uj-  =  -Wj  if  Itu^l  <  1 
—  -sgn{wj}  if  \wj  \  >  1- 

To  prove  above  statement,  we  proceed  as  follows: 
a  =  u  +  w“ 

Equation  (A4)  implies  that  the  function  t/i(u)  =  (a,  Ra) 
attains  its  minimum  at  a*  =  u*  -f  w‘. 

Since  R  is  positive  definite,  the  eigenvalues  of  R  are 
positive  for  all  t. 

Let  D  be  the  diagonal  matrix  of  the  eigenvalues.  D  = 
RP  where  P  is  an  orthogonal  matrix. 

V.(u)  =  (a,Ra)  =  (a.RDP^a) 

=  (P'^a,  DP’’ a)  =  (b,  Db) 

m 

where  b  =  P*^a, 
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Since  P  and  P’’  are  both  orthogonal,  (b.b)  =  (a,  a) 
equivalently 

m 

E‘;=E«i 

7=1  7=1 

Now  we  establish  the  relations 


min'0(u)  =  min(a,  i^a) 
u  « 

m  Tn 

=  min  y'  dj  h]  =  V)  min  6^  (A6) 


7  =  1 


7  =  1 


Equation  (A6)  implies  that  if  a*  minimizes  {a,i2a), 
then  the  components  ^>5,  *  * ' » minimize  the 
scalar  product  (b,b)  where  b  =  P^a. 

In  view  of  Eq.(A5),  we  may  conclude  that  the  vector 
PP^a*  =  a“  minimizes  the  scalar  product  (a, a). 
Therefore,  if  (a*,Pa‘)  <  {a,  Pa)  then  (a%a‘)  <  {a, a). 


We  can  reverse  our  reasoning  as  follows: 

If  {a*,  a*)  <  (a,  a)  then  (a*,  Pa'")  <  {a,  Pa). 

We  know  that 

m 

(a,  a)  =  ((u+  w*),  (u  +  w*))  = 

y=i 

m 

We  can  deduce  that  min(a,  a)  =  Y]  min  (uj  +  wj )^. 

To  minimize  the  positive  quantity  (uj  one  must 

set 

Uj  =  —vjj  whenever  jiuj]  <  1 
Uj  =  +1  whenever  w~  <  —  I 

tij  =  —  1  whenever  >  1  ■ 
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Introduction 

N  the  recent  literature,  an  asymptotic  stability  theorem*  for 
autonomous  and  periodic  nonautonomous  systems  was 
used  to  prove  the  global  asymptotic  stability  of  the  mass¬ 
spring-damper  system  and  the  damped  Mathieu  system.  For 
such  systems,  the  application  of  LaSalle’s  invariant  set  theo¬ 
rem-  has  been  the  conventional  approach  adopted  to  prove  the 
global  asymptotic  stability.  When  the  derivative  of  the  Lya¬ 
punov  function^  vanishes,  LaSalle’s  theorem^  requires  us  to 
show  that  the  maximum  invariant  set  of  the  system  consists 
only  of  the  equilibrium  point  at  its  entry.  Although  it  is  always 
simple  to  identify  the  set  of  points  Q  where  the  derivative  of 
the  Lyapunov  function  vanishes,  the  maximum  invariant  set 
/CQ  is  not  always  easy  to  identify.  The  main  challenge  of 
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for  publication  Jan.  5,  1993.  Copyright  ©  1993  by  R.  Mukherjee  and 
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Astronautics.  Inc.,  with  permission. 
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LaSalle’s  theorem^  is  therefore  to  sort  out  the  maximum  in¬ 
variant  set.  For  a  distributed  parameter  system  the  dynamics 
are  described  by  a  hybrid  set  of  ordinary  and  partial  differen¬ 
tial  equations.  For  such  a  system,  the  sorting  out  of  the  maxi¬ 
mum  invariant  set  is  not  a  trivial  task.  In  such  a  situation  it  is 
useful  to  apply  the  theorem  in  Ref.  1  so  as  to  comment  on  the 
asymptotic  stability  of  the  system. 

The  distributed  parameter  system  consisting  of  a  rigid  hub 
with  one  or  more  cantilevered  flexible  appendages  has  ap¬ 
peared  in  the  technical  literature  quite  frequently  (see  Refs.  4, 
5,  6,  and  7).  The  system  described  in  Fig.  I  consists  of  four 
appendages  that  are  identical  uniform  beams  conforming  to 
the  Euler-Bernoulli  assumptions.  Each  beam  cantilevered 
rigidly  to  the  hub  is  assumed  to  have  a  tip  mass.  The  motion 
of  the  system  is  confined  to  the  horizontal  plane  and  the  con¬ 
trol  torque  is  generated  by  a  single-reaction  wheel  actuator. 
Under  the  assumption  that  the  system  undergoes  antisymmet¬ 
ric  motion  with  deformation  in  unison  (see  Fig.  2),  a  class  of 
rest-to-rest  maneuvers  was  considered  in  Ref.  4.  For  the  partic¬ 
ular  Lyapunov  function  considered,  the  best  choice  of  the 
control  input  only  guaranteed  the  negative  semidefiniteness  of 
the  derivative  of  the  Lyapunov  function.  To  conclude  the 
global  asymptotic  stability  using  LaSalle’s  theorem,  it  would 
be  necessary  to  formally  prove  that  the  maximum  invariant  set 
consists  only  of  the  equilibrium  point.  The  global  asymptotic 
stability  of  the  system  was  claimed  in  Ref.  4  in  the  absence  of 
this  proof. 

In  this  Note  we  consider  the  hub-appendage  problem"*  with 
modifications.  The  modeling  and  successful  control  of  such  a 
system  is  expected  to  provide  us  with  insight  into  the  modeling 
and  control  of  a  general  class  of  distributed  parameter  sys¬ 
tems.  Using  a  Lyapunov  function  approach  and  the  asymp¬ 
totic  stability  theorem  in  Ref.  I,  we  prove  that  global  asymp¬ 
totic  stability  of  the  system  is  guaranteed  provided  the  system 
undergoes  antisymmetric  motion  with  deformation  in  unison. 
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In  Other  situations,  such  as  symmetric  motion  with  deforma¬ 
tion  in  opposition  (see  Fig.  2),  such  a  conclusion  cannot  be 
drawn. 

Theorem  on  Asymptotic  Stability 

Consider  the  nonautonomous  system 

(1) 

where /:/?^  x£>  — /?"  is  a  smooth  vector  field  on  D 

CR"in  the  neighborhood  of  the  origin  x  =  0.  Let  x  =  0  be  an 
equilibrium  point  for  the  system  described  by  Eq.  (1).  We  now 
state  the  theorem  on  asymptotic  stability.* 

Theorem.  1)  A  necessary  condition  for  stable  nonau¬ 
tonomous  systems:  Let  F(/,x) :  /?+  xD--R^  be  locally  posi¬ 
tive  definite  and  analytic  on  R^xD^  such  that 

is  locally  negative  semidefinite.  Then  whenever  an  odd  deriv¬ 
ative  of  y  vanishes,  the  next  derivative  necessarily  vanishes 
and  the  second  next  derivative  is  necessarily  negative  semi¬ 
definite.  2)  A  sufficient  condition  for  asymptotically  stable 
autonomous  systems:  Let  K{x):D— be  locally  positive 
definite  and  analytic  on  D,  such  that  V<0.  If  there  e.xists  a 
positive  integer  k  such  that 

I/<-^'‘*(x)<0  Vx7^0:F(x)  =  0 

=0  for  2k 

where  Kt**(x)  denotes  the  (*)th  time  derivative  of  V  with  re¬ 
spect  to  time,  then  the  system  is  asymptotically  stable.  How¬ 
ever,  if  K^^*(x)  =  0,  y/=  1,  2,..., 00,  then  the  sufficient  condi¬ 
tion  for  the  autonomous  system  to  be  asymptotically  stable  is 
that  the  set 

S  =  {x:  K<-''(x)  =  0.  vy  =  l,2 . ooj 

contains  only  the  trivial  trajectory  x  =  0. 

Hub-Appendage  Problem 

This  example  is  taken  from  Ref.  4  with  some  modifications. 
The  hybrid  system  of  ordinary  and  panial  differential  equa¬ 
tions  governing  the  dynamics  of  the  system,  which  has  already 
been  described  in  the  introduction,  is 


1 


Fig.  1  Distributed  parameter  autonomous  system  consisting  of  a 
rigid  hub  with  four  cantilevered  flexible  appendages. 


d20  " 

^hub  ,  5  —  U  2.^  (-^^iO  ““  rS/o) 


(3) 


“  (A^/o  ~  •^/o) 


dx  +  m/ 


dt^ 


,) 


i  =  1,  2,  3,  4 

(4) 

(ar*  toV 

aVi 

l  +  £/^=0. 
dx* 

/  =  i,  2,  3,  4 

(5) 

Fig.  2  Antisymmetric  and  symmetric  motion  of  the  system  consist¬ 
ing  of  a  rigid  hub  and  four  flexible  appendages:  A  is  the  antisymmetric 
motion  (deformation  in  unison),  y'i(x,  r)  =  3?2(x,  r)=y3(x,  t)=yA{x, 
r)  and  B  is  the  symmetric  motion  (deformation  in  opposition),  yi(x,  r) 
-  -yi{x,t),  yiix,  r)=  -y4(x,  t). 


The  state  of  the  system  is  described  by  a  hybrid  set  of  dis¬ 
crete  and  continuous  variables: 


Z  = 


6,  e,yi(x,i),...,y4{x,t). 


dydx,  0 

bt 


^y^(x,  t) 
dt 


(9) 


The  boundary  conditions  on  Eqs.  (3-5)  are 
dyi 


yi{t,r)  = 

d^y, 

dx^ 


dx 

=  0, 


dx^ 


,  £/V  d/^  /A 


=  0,  /  =  1,2,  3,  4  (6) 

/  =  1,2,  3,  4  (7) 

1  =  1,2,  3,  4  (8) 


We  choose  the  Lyapunov  function  as 

K  .  f  f  («-.,)■  +  f  E  [  .rf)'  to 

to  derive  control  laws  that  will  drive  the  system  to  its  desired 
state  Zd«ircd  =  (0/.  0.  0 . 0,  0,...,0).  In  Eq.  (10),  a,,  Oi,  and 
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Oi  are  positive  constants.  It  can  be  shown^  that  the  choice  of 
u{()  as 


4 

^2^  +  ^4^  +  (^3  ^l)  E  ^io) 

1=  I 


<74>0  (11) 


in  Eq.  (3)»  leads  to  K=  -046'.  Clearly,  Vis  negative  semidef- 
inite  and  is  equal  to  zero  if  0  =  0.  To  check  for  the  asymptotic 
stability  of  the  system  using  the  theorem  in  Ref.  1,  we  first 
compute  the  higher-order  derivatives  of  V.  We  find  that  when 
y=0,  the  following  always  holds 

=  /  =1.2 . 2k  (12) 

for  some  positive  integer  In  Eq.  (12),  denotes  the  (♦)th 
time  derivative  of  K,  and  denotes  the  (♦)th  time  derivative 
of  6.  Using  Eq.  (12)  and  the  sufficient  conditions  of  the  asymp¬ 
totic  stability  theorem, •  we  conclude  that  the  system  is  globally 
asymptotically  stable  if  for  any  positive  integer  k.  In 

other  words,  if  ^=0  at  some  time  /  =  T,  then  the  system  will 
be  globally  asymptotically  stable  if  6  is  not  a  constant  for  all 
/  >  7",  and  is  a  constant  only  qt  the  equilibrium  point. 

We  now  investigate  the  case  where  ^  is  a  constant  at  a  point 
other  than  at  the  equilibrium  point  where  Z  Let  this 
constant  be  6c^  Then  Eqs.  (3-5)  simplify  to 

u-L  (''5,0-Mo)  =  0  (13) 

/=i 


d’Vi  an- 

-  (Mo  -  =  px  — ^  dx  +  ml 

r  d/"  df’ 


d!- 


/■  =  L  2,  3,  4 


»  /  =  I,2,  3,  4 

(14) 

(15) 


The  boundary  conditions  given  by  Eqs.  (6)  and  (7)  remain 
unchanged,  but  the  boundary  condition  given  by  Eq.  (8)  sim¬ 
plifies  to 


Because  Yix,  t)=^ Fix)G{i),  and  C(/)  is  a  constant,  Eqs.  (20) 
and  (21)  imply 


d^Y 

dx^ 

=  0  => 

a>Y 

— :  =  const 

ax^ 

(22) 

d^Y 

dx^ 

=  0 

/ 

(23) 

From  Eqs.  (22)  and  (23)  it  follows  that  {d^Y/dx^)  =  0,  which 
implies  that  {d-Y/dx^)  is  a  constant.  Additionally,  the  value 
of  this  constant  can  be  shown  to  be  zero  from  the  boundary 
condition  in  Eq.  (7).  Proceeding  in  the  same  way  and  using 
the  boundary  conditions  in  Eq.  (6),  it  is  trivial  to  show  that 
{dY/dx)=  K(x, /)  =  0.  This  implies  from  Eqs.  (18)  and  (17) 
that  z/=0  and  6c  =  0f.  Clearly,  the  maximum  invariant  set 
for  the  system  comprises  the  set  of  points  where  6  =  6/, 

6  =  0,  and  D?=i>'/(a%  r)  =  0.  If  there  exist  functions >'i(A,  t)7^0, 

/  =  1,  2,  3,  4  such  that  T  =  =  0  holds,  then  the  set 

S  =  \Z  :  y^^'\Z)  =  0,  vy  =  l,  2,...,oo|  contains  entries  other 
than  the  trivial  solution  Z  =  Zdesircd-  In  such  a  situation  we 
cannot  claim  global  asymptotic  stability  of  the  equilibrium 
point.  Such  a  situation  may  arise  in  the  case  of  symmetric 
deformation  in  opposition,  shown  in  Fig.  2,  where  yi(jr, /) 
=  -  V2(.v,  0  and  ;^3(a',  /)=  -y4(A',  /).  In  such  a  situation,  the 
residual  energy  of  the  system  remains  trapped  within  the 
beams.  There  exists  no  net  interacting  moment  between  the 
hub  and  the  beams,  and  the  hub  remains  motionless  at  its 
desired  configuration  6  =  6/. 

The  case  of  antisymmetric  deformation  in  unison,  shown  in 
Fig.  2,  was  considered  in  Ref.  4.  In  this  case,  it  is  assumed  that 
.y,0v, /)  =  V:(a',  0=3'3(a',  0=3*4(a', /).  When  Yix,t)  =  0,  this 
implies  that  y,{x,  0  =  0  for  /  =  1,  2,  3,  4.  Therefore,  for  anti¬ 
symmetric  deformation  in  unison,  it  is  quite  simple  to  show 
that  the  set  S  =  |Z  :  K<>>(Z)  =  0,  vy  =  1,  2,...,oo|  contains 
only  the  equilibrium  point  Z  =Zd«ired-  Consequently,  we  can 
establish  the  asymptotic  stability  property  of  the  hub  with  the 
flexible  appendages  undergoing  antisymmetric  deformation  in 
unison  under  the  input  defined  by  Eq.  (1 1).  The  control  law 
given  in  Eq.  (11)  was  used  to  stabilize  the  system  to  the  equi¬ 
librium  point  in  Ref.  4,  but  no  formal  proof  for  the  asymptotic 
stability  was  provided. 


d^yj  _  ^ 

dx^  ,  ~  El  dt^ 


/  =  1,  2,  3,  4  (16) 


Also,  the  input  to  the  system  u{t)  defined  by  Eq.  (11)  can  be 
simplified,  using  Eq.  (13),  to 


U  =  t  (rSio-Mio)  =  ^  (0,  -  fie)  =  C  =  const  (17) 

i  =  l  0} 


If  we  define  T  =  2  (I^)  implies 

1  =  1 


Conclusion 

The  rest-to-rest  maneuver  of  the  distributed  parameter  sys¬ 
tem  consisting  of  a  rigid  hub  with  four  cantilevered  flexible 
appendages  was  studied.  The  best  choice  of  the  control  input 
resulted  in  the  negative  semidefiniteness  of  the  derivative  of 
the  Lyapunov  function.  An  invariant  set  analysis  of  the  system 
was  subsequently  carried  out  using  an  asymptotic  stability  the¬ 
orem.*  The  analysis  establishes  the  fact  that  the  hub-ap¬ 
pendage  system  is  globally  asymptotically  stable  when  the  sys¬ 
tem  undergoes  antisymmetric  motion  with  deformation  in 
unison. 


ax’ "  dx\ 


=  |=const 


(i8) 


If  we  make  the  reasonable  assumption  that  T(x,  /)  is  of  the 
form  Yix,  /)  =  F(a)0(/),  then  Eq.  (18)  leads  to 


(19) 


Equation  (19)  implies  that  C(r)  is  a  constant.  Summing  Eqs. 
(15)  and  (16)  over  /  =  1  to  /  =  4,  we  have 


P 


at^  dx*  ~  ° 


(20) 


ajc» ,  ~  El  at-  I 


(21) 
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NEAR-MINIMUM-TIME  THREE-DIMENSIONAL  MANEUVERS 
OF  RIGID  AND  FLEXIBLE  SPACECRAFT 


Mark  J.  BelP  and  John  L.  Junkins^ 


An  approach  is  presented  to  accomplish  large  angle,  nonlinear,  three  dimensional 
attitude  maneuvers  in  either  near-minimum-time  or  near-minimum-fuel.  The 
method  permits  the  specification  of  a  torque  shaped  reference  maneuver  of  the 
near-minimum-time  (bang-bang)  or  near-minimum- fuel  (bang-off-bang)  type;  the 
instantaneous  switches  are  replaced  by  controllably  sharp  spline  switches  to  reduce 
excitation  of  flexible  degrees  of  freedom.  A  Lyapunov  method  is  used  to  design 
tracking-type  control  perturbations  to  suppress  errors  due  to  disturbances  and 
model  errors.  The  method  is  illustrated  by  numerical  simulations  and  some 
experimental  results  using  the  ASTREX  test  article. 

INTRODUCTION 

Primarily  due  to  mass  considerations,  future  spacecraft  will  most  likely  have 
large .  flexible  appendages  and  exhibit  significant  coupling  between  overall  rigid 
body  motion  and  vibratory  motion.  Many  of  these  spacecraft  will  be  required 
to  perform  a  variety  of  maneuvers  in  three-dimensions  in  near-minimum-time,  or 
near-minimum-fuel,  with  limited  computational  abilities,  while  suppressing  flexible 
modes  of  vibration.  A  torque-shaped  reference  maneuver  design,  augmented  by  a 
Lyapunov  stable  tracing  law  can  achieve  these  stated  requirements  with  robustness 
in  the  presence  of  uncertainty. 

The  main  goal  of  this  paper  is  to  demonstrate  one  effective  approach  to  control 
a  flexible  spacecraft  in  near- minimum- time  in  three  dimensions  while  actively 
and  passively  suppressing  flexible  modes  of  vibration.  Secondarily,  an  an^ogous 
development  for  the  the  near-minimum-fuel  case  are  presented.  Feasibility  of 
this  approach  is  discussed  based  upon  analysis,  computer  simulation  using  both 
a  rigid-body  and  a  flexible-body  simulator,  and  through  results  from  laboratory 
experimentation.  The  experimental  portion  of  this  research  was  performed  on  the 
Advanced  Space  Structure  Technology  Research  Experiment  (ASTREX)  test  article 
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located  in  the  Phillips  Laboratory,  Edwards  Air  Force  Base,  California.  This  study 
was  undertaken  as  a  part  of  the  NASA/DOD  Guest  Investigator  program. 

The  basic  concepts  underlying  modern  spacecraft  dynamics  and  control  have 
been  treated  by  many  authors,  including  Junkins  and  Turner.'  Single-axis  control 
of  flexible  spacecraft  has  been  studied^-^  and  the  optimal  control  problem  in  three- 
dimensions  has  been  addressed  by  Vadali,  Singh,  and  Carter.^’®  Near-minimum¬ 
time  control  of  dynamic  systems,  which  include  single-axis  maneuvers  of  flexible 
spacecraft  and  flexible  manipulators,  have  also  been  studied.®-'^  The  purpose  of 
this  paper  is  to  present  a  general  three-dimensional  approach,  leading  to  maneuver 
laws  for  the  ASTREX  structure.  General  model  information,  as  well  as  a  rigid  body 
model  and  a  flexible  body  model  for  the  ASTREX  test  article,  are  available.'® ’'4 
A  near-minimum-time  approach  is  formulated  to  control  the  ASTREX  orientation 
while  vibration  is  attenuated  using  input  smoothing".  Additionally,  effects  of  model 
errors  and  disturbances  are  compensated  using  an  asymptotically  stable  feedback 
controller  based  on  the  work  by  Junkins  et  al",  Wie  et  al'®,  Vadali'®,  and  Junkins 
and  Kim'^. 

EQUATIONS  OF  MOTION 

The  rigid  body  dynamics  are  modeled  using  Euler’s  equations  for  a  rigid  body. 
The  matrix  [I]  is  the  inertia  matrix,  u  is  the  angular  velocity  vector,  [a;]  is  the  matrix 
representation  of  the  standard  cross-product,  and  [B]  is  the  control  influence  matrix, 
each  of  which  has  dimension  3x3. 

[I]<k+p][lU=[B]u  (1) 

The  control  input  to  this  equation  consists  of  a  reference  control,  and  a 
tracking  control  or  terminal  control,  6u,  as  shown  below. 

M  =  Mre/  + 

The  kinematic  equations  used  in  the  spacecraft  model,  equations  (3)  and  (4), 
are  the  set  of  1-2-3  Euler  angles  which  were  used  to  determine  the  body’s  position 
in  space  relative  to  a  fixed  coordinate  system. 

r  Cos{6z)  -Sin{6z)  0 

(^1  =  7^ — TTT  Cos{92)Sin{0z)  Cos{62)Cos{9z)  0  {w}  (3) 

I  J  Cos{e2)  [_5in{02)Gos(03)  Sin{92)Sin{9z)  Cos(02). 

'  Cos{92)Cos{9z)  Sin{9z)  O] 

{w}=  -Cos{92)Sin{9z)  Cos{9z)  0  m  =  [^(0)]  |£|  (4) 

Sin{92)  0  1_ 

These  equations  are  used  to  orient  the  rigid  body  relative  to  a  fixed  inertial 
frame. 
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THE  CONTROL  LAWS 
Near-Minimum-Time  Maneuvers 

The  simplest  minimum-time  maneuver  for  a  near-rigid  vehicle  undergoing  a 
single-axis  maneuver  is  a  single  switch  “bang-bang”  control  law.  However,  the 
sharp  switching  will  excite  some  flexible  modes  of  vibration.  The  near-minimum- 
time  maneuver  proposed  rounds  off  the  sharp  switches  by  replacing  the  sharp 
discontinuities  with  a  controllably  sharp  cubic  polynomial  and  introducing  a  shaping 
parameter  a  :  0  <  a  <  0.25,  where  at  a  =  0,  the  torque  profile  is  a  square  wave  and 
at  a  =  0.25,  the  torque  profile  is  a  smooth  sine-shaped  profile  satisfying  zero  initial 
and  final  slope  conditions.  It  should  be  noted  that  as  a  increases,  the  maneuver 
time  (tf)  increases,  and  the  vibrational  energy  is  expected  to  decrease  due  to  the 
greatly  increased  rolloff  in  the  spectral  content  of  the  control  input.  The  cubic 
polynomial,  defined  as  the  shaping  function  /(t,a,t/)  is  defined^^  as  follows. 


1 


for  0  <  t  <  At  =  atf 
for  At  <  t  <  t//2  -  At  =  ti 


/(t,a,t/)  =  {1-  [3  -  2  (^)]  for  ti  <  t  <  t//2  -f  At  =  tj 

_1  fort2  <t<t/-At  =  t3 


(11) 


The  basic  idea  underlying  this  torque-shaping  approach  is  to  establish  a  smooth 
rigid  body  reference  maneuver,  £,,gy(t),  then  calculate  the  corresponding  open  loop 
control  law  by  inverse  dynamics.  This  reference  torque,  when  applied  to  the  body, 
will  make  0t)  approximate  0,,g^(t).  The  Lyapunov  tracking  law,  which  is  discussed 

in  the  next  section,  seeks  to  cause  £(t)  to  track  in  the  presence  of  disturbances 

and  other  non-ideal  effects  while  also  suppressing  structural  vibrations.  As  will  be 
evident,  it  is  possible  to  develop  the  tracking  law  to  guarantee  asymptotic  stability 
in  the  absence  of  model  errors.  The  development  of  the  open  loop  control  law  is 
shown  below,  beginning  with  the  standard  linear  second  order  equation  of  motion 
for  a  rigid  body: 


m=[B]u 

=  [B]u^axmoc^^f) 


(6) 

(7) 


This  equation  can  be  applied  to  the  reference  maneuver,  manipulated,  and  then 
integrated  twice,  yielding  6 ref-,  Kef-,  and  6 ref  as  shown. 


ie/(t)  =  [7]-'  [B]  Uma.  /(«. 

tef  it)=L  +  [7]"^  [B]  Umax  ^  V) 

e^ef  it)=L+Lt  +  [7]"'  [B]  Umax  ^  ^  /(^i  */) 


«6 
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However,  the  shaping  function,  /(t, t/))  can  be  integrated  twice,  piecewise, 
with  the  elegant  generalization  of  the  bang-bang  (a  =  0)result: 


Substituting  this  result  into  the  previous  equation  and  considering  a  rest-to-rest 
maneuver  (£(to)  =  £(tf)  =  o)  yields  the  following  expression  for  9^  -  9^: 

£/-«<,  =  W-'  |B|  t/^  (i  -  5  “  +  ^  (12) 


This  equation  can  then  be  inverted  to  solve  for  the  required  maneuver  time 
on  each  axis,  as  a  function  of  the  maneuver  angle  change,  shaping  parameter,  and 
maximum  torques  as: 


^  ^/2  j 

>  = 

The  total  maneuver  time,  t/,  is  then  simply: 


tf  = 


(13) 


(14) 


The  effect  of  increasing  alpha  on  a  normalized  maneuver  time  and  the  resulting 
profiles  are  shown  below  as  Figure  1.  As  expected,  the  maneuver  time  increases  as 
a  increases,  as  illustrated  in  the  figure. 


Figure  1.  -  Bang-Bang  Shaping  Function  vs.  Normalized 
Time  for  Increasing  a. 
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n 


The  total  maneuver  time,  t/,  can  be  substituted  into  the  previous  equatmn 
and  a  vector  of  constants  containing  the  maximum  torques,  which  will  be  applied 
on  each  axis,  ur,  can  be  determined. 


(15) 


The  values  of  Ur,  tf^  and  a  selected  value  of  a,  can  then  be  inserted  into 
equations  (8)-(10),  yielding: 

ie/(t)  =  [I\~^  [B]URf{t,OC,tf)  (16) 

J  0 

irc!(t)  =  l,+Lt+[ir'[B]nRj^J^nv,o:,t,)d^dT  (18) 


Now,  using  the  exact  rigid  body  dynamics,  we  can  solve  for  a  control  «re/(<) 
which  would  cause  the  rigid  body  vehicle  to  execute  the  maneuver  ^re  fit).  First, 
the  kinematic  equations  for  the  set  of  1-2-3  Euler  angles,  shown  in  matrix  form  as 
equation  (4),  can  be  used  and  then  differentiated  to  determine  i^refi^) 

a„/(<)  =  [C  («../)]  Lf 

m  =  I  [C  (£„,)]  Lf  +  [C  (2™/)]  i./  (20) 

The  reference  torque,  ttre/(l)’  found  by  inverse  dynamics,  using 

Euler’s  equation. 

1 

=  [B]  ^  ([^l^kre/  l^re/j  [^filref)  (^1^ 

Hence,  the  near-miiiimum-time  torque-shaped  maneuver  has  been  extended  to 

the  three  dimensional  case.  , 

Motivated  by  the  need  to  consider  a  wider  class  of  reference  maneuvers,  such  ^ 

near-minimuin-fuel,  it  was  noted  that  any  function  which  is  twice  integrable  may  in 
principle  be  used  as  the  shaping  function.  Seeking  to  establish  a  torque-shaped 
family  of  near-niiiiimum-fuel  maneuvers,  we  consider  the  bang-off-bang  control 
parameterization  shown  below  as  equation  (22),  where  tz  denotes  the  time  at  the 
end  of  the  first  pulse,  (3  corresponds  to  the  coast  time,  and  o:  parameterizes  the 


i 


i 


i 


98 


sharpness  of  the  of  the  control  on/off  profile. 


g{t,a,l3,t3)=  < 


0 


(^)^[3-2(^)] 


-1 


for  0  <  t  <  <1  =  a2at3 
for  ti  <  t  <  t2  =  (1  ~  2a)t3 
for  t2<t<t3=t3 
for  tj  <  t  <  <4  =  t3  +  /3 
for  t4  <  t  <  ts  =  ti  + 13  +  ^ 
for  h  <t<t6  =  t2+t3+l3 


(22) 


[3-2(^)]  fori6<t<t/=2(a  +  /3 


Following  the  same  procedure  yields  an  alternative  torque  shaped  control  law. 
Figure  2  shows  the  effect  of  increasing  alpha  from  0  to  0.25  on  the  normalized 
maneuver  time  while  holding  constant  at  1.  This  figure  shows  that  the  maneuver 
time  increases  as  the  control  profile  becomes  smoother. 


Figure  2.  -  Bang-Off-Bang  Shaping  Function  vs.  Normalized 

Time  for  Increasing  a. 


The  effect  of  decreasing  beta,  while  maintaining  a  constant  value  for  a  of  0.25 
on  the  maneuver  time,  is  shown  in  Figure  3.  Again,  the  maneuver  time  increases 
as  the  coast  time  is  increased,  as  seen  in  the  figure. 


Figure  3.  -  Bang-OfF-Bang  Shaping  Function  vs.  Normalized 

Time  for  Decreasing 

Two  open-loop  control  laws  have  been  developed  in  this  section;  computational 
and  laboratory  experimental  results  are  discussed  below.  These  open-loop  control 
laws  are  established  for  a  general  rigid  body  that  moves  in  three-dimensional  space, 
although  it  is  recognized  that  these  reference  maneuvers  may  be  significantly  sub- 
optimal  for  the  case  when  the  gyroscopic  coupling  effects  are  large.  Application 
of  this  torque-shaping  scheme  to  any  rigid  body  requires  a  priori  knowledge  of  the 
inertia  matrix  and  the  control  influence  matrix,  both  of  which  are  required  to  be 
invertible.  As  is  evident  in  the  robustness  studies,  however,  including  a  well  designed 
tracking  law  to  compensate  for  larger-than-expected  errors. 

The  open-loop  control  laws  presented  in  this  section  are  exact  solutions  based 
on,  an  inverse  dynamics  approach.  Although  the  control  laws  are  expected  to 
perform  well,  a  closed-loop  feedback  control  law  will  almost  always  be  needed 
to  compensate  for  approximation  errors  as  well  as  disturbances  and  identification 
errors. 

A  Lyapunov  Tracking  Controller 

The  tracking  controller  is  a  Lyapunov  tracking  controller  which  uses  a  different 
parameterization  of  the  positional  error  energy  term.  The  Euler  parameters,  are 
used  to  relate  the  actual  frame  to  the  reference  frame  of  the  body;  note  (  is  often 
known  as  the  “error  quaternion” .  Hence,  when  these  two  frames  coincide,  the  Euler 
parameters  will  be  identically  C  =  [1000]^.  The  Lyapunov  function  and  its  first 
derivative  is  shown  below. 

2V  =  {/]  Scj  d-  [W]  C  (23) 

V  =  6uF[I]S^  +  ^[W]C  (24) 

Through  manipulations  to  follow,  the  time  derivative  of  V  in  Equation  (24)  can 
be  re-arranged  to  form  V  —  5cii{fnct(5u,C,w)},  and  this  structure  can  be  exploited 
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to  determine  a  control  law  for  6u  which  guarantees  K  <  0.  Calculating  the  Euler 
parameters  from  the  1-2-3  set  of  Euler  angles  of  the  actual  frame  and  the  reference 
frame  is  a  straightforward  process.  The  orthonormal  rotation  matrix  from  the 
inertially  fixed  frame  to  the  actual  frame  is  shown  below.  It  should  be  noted  that 
Si  and  Ci  stand  for  sin{di)  and  cos{di),  respectively. 


[T(0)]  = 


C2C3 

-C2S3 

S2 


S3CI  +  C3S251 
C3C1  —  53S2S1 
-C2S1 


Si 53  -  C3S2CI 

S1C3  —  S3S2C1 
C2C1 


(25) 


Additionally,  the  rotation  from  the  fixed  frame  to  the  reference  frame  is 
identical  in  format  with  the  exception  that  Si  and  Ci  stand  for  sin(0re/.)  and 
cos{0refi),  respectively.  The  rotation  matrix  between  these  two  frames  can  be  found 
easily  using  linear  algebra,  noting  the  fact  that  the  inverse  of  an  orthonormal  matrix 
is  its  transpose.  The  rotation  from  the  fixed  frame,  whose  orthogonal  unit  vectors 

are  denoted  by  n,  to  the  body  reference  frame,  kref^  actual  frame,  S,  are 

shown  below. 

S  =  [T(0)]n  (26) 

l,f  =  [T{e,efU  (27) 

The  second  of  these  equations  can  then  be  inverted  yielding  an  expression  for 
projecting  the  fixed  frame  unit  vectors  onto  the  reference  frame. 

n  =  TO,e/)]"’ie/  (28) 


This  equation  can  then  be  substituted  into  equation  (26),  yielding  the  desired 
relationship  between  the  actual  frame  and  the  reference  frame. 

b=[Tm[T{0ref)flef  (29) 

The  error  rotation  matrix  between  the  two  frames  is  then  defined  as  [R]. 


[R]  =  [Tm[T{Bref)f  “  (80) 

We  note  that  [R]  is  typically  a  near-identity  matrix  because  it  represents  the 
tracking  error  angular  displacement  of  S  from  href'  [7^]  been  computed, 

the  set  of  Euler  parameters  between  these  two  frames  can  be  computed  as  follows: 

trace{R)  =  Rn  -1-  R22  +  R33  (81) 


Co  =  ^1^(1  +  trace{R))\ 


(32) 
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Cl  =  77- (^23  -  -^32) 
4Co 

C2  =  -  Rn) 

Cj  = 


(33) 

(34) 

(35) 


This  set  of  Euler  parameters  is  governed  by  the  following  matrix  differential 
equation: 

-Cl  — C2  — Cs  " 

Co  -C3  C2 
C3  Co  -Cl 
L— C2  Cl  Co  -I 


c  = 


Su 


(36) 


c  =  m)] 

Taking  the  transpose  of  this  equation  yields: 

C’’  =  ia;’'lG(C)l’’ 


(37) 


(38) 


By  utilizing  this  result  and  Euler’s  equation  (1)  to  eliminate  [I]6u,  equation 
(24),  the  derivative  of  the  Lyapunov  function  can  be  arranged  in  the  desired  form. 
This  will  permit  construction  of  a  stabilizing  feedback  control  law. 


V  =  601  (-[w][/]w  +  [5re/][/]Wre/  +  +  [^(C)]^  [^]  C)  (39) 

=  —6u^[K\6ui  (^^9) 

The  second  step,  Equation  (40),  is  motivated  by  the  desire  that  6u  be  chosen 
such  that  7  <  0.  Equating  the  right  hand  sides  of  the  previous  two  equations  yields 
an  intermediate  algebraic  equation: 

-\K]5u  =  -[ui][/]w  +  [Sre/][/]Wre/  +  1^5]%  +  [G{Of  [W]  C  (41) 

Solving  for  the  feedback  control  6u  from  equation  (41)  yields  the  asymptotically 
stable  feedback  control  law: 

6U  =  [B]-'  {-[K]6U  +  [U][I]CV  -  PreAmref  "  [<^(0]""  [1^1  C)  (42) 

This  perturbation  is  superimposed  on  the  reference  control  in  the  sense  2k{t)  = 
Mre/(*)  +  M*)-  The  gains  [K]  and  [W]  were  selected  subject  to  the  eigenvalue 
placement  constraint  that  they  produce  critical  damping  on  the  linearized  second 
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order  linear  model  for  rigid  body  motion.  In  addition,  a  scaled  inertia  matrix 
used  as  the  gain  matrix  [K],  since  this  provides  a  one-parameter  family  of  symmetric 
and  positive-definite  gain  matrices.  It  should  be  noted  that  the  matrix  [W],  as  shown 
below,  is  not  positive-definite.  However,  if  the  last  three  terms  in  the  relative  Euler 
parameter  set  C  are  zero,  then  perfect  tracking  is  accomplished  (i.e.  the  set  of  Euler 
parameters  is  redundant).  Hence,  the  fact  that  [W]  is  semi-positive  definite  is  not  a 
problem  due  to  the  redundancy  of  the  Euler  parameters.  The  gains  used  throughout 
this  paper  are  shown  below.  More  generally,  the  gain  matrices  would  be  subject  to 
optimization  over  the  set  of  stable  gains  to  extremize  a  performance  measure,  along 
the  lines  of  Junkins  and  Bang^®. 


[K]  =  Cl  [I\-,  Cl  =  2.5298  and  [W]  =  C2 


0 

0 


0^- 


C2  =  1.6 


EXPERIMENTAL  RESULTS 
Bang-Bang  Experimental  Results 


The  Advanced  Space  Structure  Technology  Research  Experiment  (ASTREX) 
test  article  is  a  large  experimental  structure  that  resembles  a.  spaced-based  laser 
beam  expander  as  shown  in  Figure  4.  The  5000  kilogram  structure  is  mounted  on  a 
spherical  air  bearing  and  is  maneuvered  using  a  specified  set  of  cold  gas  thrusters. 
A  set  of  six  8-pound  thrusters  or  a  set  of  four  200-pound  thrusters  plus  two  8-pound 
thrusters  are  available  for  controlling  the  structure.  For  each  set,  two  thrusters  fire 
in  unison  to  produce  torque.  Hence,  three  sets  of  two  thrusters  firing  in  unison  are 
needed  to  control  the  test  article  in  three  dimensions.  All  thrusters  are  powered 
by  compressed  air  which  is  stored  in  two  pressurized  tanks.  These  pressure  tanfe 
have  a  limited  supply  of  compressed  gas  which  results  in  a  fuel  constraint.  To  avoid 
difficulties  with  the  fuel  constraint,  only  the  bang-off-bang  control  law  is  used  in 
conjunction  with  the  200-pound  thruster  set. 

The  first  set  of  experimental  results  was  tested  using  the  set  of  8-pound 
thrusters  operating  at  a  maximum  thrust  of  3  pounds  in  conjunction  mth  the 
open-loop  bang-bang  control  profile.  The  inertia  matrix  and  the  control  influence 
matrix  for  the  structure  were  given  in  reference  14  and  were  found  by  using  a 
system  identification  technique.  Due  to  the  fuel  constraint  and  to  a  nonlinear  valve 
problem  associated  with  low  tank  pressure,  the  maximum  thrust  from  each  thruster 
was  limited  to  three  pounds  .  The  open-loop  reference  profile  used  on  the  first  test 
is  a  fifty-degree  yaw  maneuver. 
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GImbal  angl«#;  Roll,  Pitch.  Yaw  dagraaa 


Figure  5  -  Bang-Bang  Open-Loop  Experimental  Angle  Profile 
on  the  ASTREX  Test  Article 

During  experimentation,  the  thruster  commands  were  given  in  volts,  which  were 
measured  and  stored  as  input,  and  a  pressure  feedback  on  each  individual  thruster 
was  used  to  determine  the  output  force  at  each  thruster.  Additionally,  three  gimbal 
angles  and  tank  pressure  were  also  sensed  and  stored  as  output.  Figure  5  shows 
the  gimbal  angles  in  the  body  frame  with  respect  to  time  in  the  form  of  three  strip 

^  °  This  figure  shows  that  the  test  article  moved  approximately  forty-two  degrees 
in  the  yaw  direction.  This  is  eight  degrees  short  of  the  specified  maneuver.  The 
rotation  in  the  roll  direction  is  oscillatory,  but  small.  This  small  discrepancy  could 
have  been  caused  by  any  unmodeled,  unsymmetric  mass  in  the  model  or  by  a 
thruster  pair  generating  slightly  different  forces,  or  due  to  unmodeled  suspension 
system  dynamics.  The  pitch  angle  encoder  appears  to  have  a  sensor  or  grey  code 
problem  which  causes  the  noisy  output  signal.  However,  the  actual  and  measured 
motion  in  the  pitch  direction  are  small.  It  should  be  noted  that  these  tests 
were  performed  open-loop  and  thus  no  on-line  feedback  corrections  were  made  to 
compensate  for  modeling  or  hardware  errors.  It  is  anticipated  that  the  closed-loop 
control  capability  for  the  ASTREX  structure  will  exist  in  the  calendar  year  1994 

The  motion  in  the  yaw  direction  is  approximately  16%  short  of  the  specified  50 
degree  maneuver;  this  could  have  been  caused  by  a  number  of  factors.  If  the  inertia 
used  in  the  design  model  was  smaller  than  the  actual  inertia  of  the  structure,  a 
smaller  angle  change  would  be  expected.  The  hardware  cables  are  suspended  from 
the  structure;  this  produces  cable  drag,  a  rotational  spring-like  force  in  the  yaw 
direction,  as  the  cables  are  pulled  away  from  their  equilibrium  position.  A  cable- 
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follower  mechanism  attempts  to  compensate  for  this  problem;  while  the  magnitude 
of  the  cable- follower  induced  disturbances  are  reduced,  the  cable- follower  dynamics 
adds  additional  complexity  in  modeling  the  disturbances  acting  on  the  structure. 
This  uncompensated  cable  drag  phenomena  would  also  produce  smaller  motion  in 
the  yaw  direction  in  addition  to  a  small  angular  velocity  which  would  remain  about 
the  yaw  axis  as  the  structure  returned  to  its  equilibrium  position.  A  final  cause  of 
the  under- rot  at  ion  problem  is  known  to  be  due  to  low  tank  pressure  near  the  end 
of  the  maneuver.  Figure  6  shows  the  thrust  commanded  to  each  individual  thruster 
in  volts,  this  graph  is  identical  to  the  output  from  the  control  law  design  except  for 
the  conversion  of  thrust  to  volts. 
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Figure  6  -  Bang-Bang  Open-Loop  Experimental  8  Lb.  Commanded 
Thruster  Profile  on  the  ASTREX  Test  Article 


Figure  7  shows  the  output  thrust  at  the  nozzle  of  each  thruster.  The 
degradation  of  the  thrust  on  the  first  two  sets  of  thrusters  can  be  seen  beginning 
around  18  seconds,  where  the  output  profile  becomes  piecewise  linear  and  decreases 
in  comparison  with  the  smooth  commanded  thrust.  Although  the  degradation  is 
not  severe,  it  is  definitely  present.  At  low  pressures,  the  solenoid  valves  behave  in 
a  poorly-modeled  nonlinear  fashion,  especially  evident  when  the  valves  are  being 
closed.  Notice  the  lack  of  left-right  symmetry  on  all  six  final  “braking”  pulses  of 
Figure  7. 
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Figure  7  -  Bang-Bang  Open-Loop  Experimental  8  Lb.  Actual 
Thruster  Profile  on  the  ASTREX  Test  Article 
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Figure  8  -  Bang-Bang  Open-Loop  Experimental  Tank 
Pressure  Profile  on  the  ASTREX  Test  Article 

Figure  8  shows  the  tank  pressure  profile  in  pounds  per  square  inch.  It  is  noted 
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that  thrust  deterioration  for  the  8-pouiid  thrusters  occurs  as  the  tank  pressure  falls 
below  150  psi  at  18  seconds.  Notice,  comparing  Figures  7  and  8,  that  the  relatively 
most  significant  thruster  anomalies  occurred  at  tank  pressures  well  below  150  psi 
(i.e.  the  final  15  seconds  of  the  maneuver). 


Bang-OfF-Bang  Experimental  Results 

The  second  set  of  experimental  results  was  performed  using  the  bang-off-bang 
open-loop  control  law  in  conjunction  with  the  set  of  four  200-pound  thrusters  and 
two  8-pound  thrusters.  The  specified  maneuver  was  a  150  degree  yaw  maneuver, 
with  the  8-pound  thrusters  limited  to  three  pounds  each  and  the  200-pound 
thrusters  limited  to  50  pounds  each  for  fuel  and  safety  reasons.  Figure  9  shows 
the  gimbal  angles  verses  time  for  the  second  set  of  experimental  results. 
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Gimbal  angle*:  Roll,  Pitch,  Yaw  degroM 


Figure  9  -  Bang-Off-Bang  Open-Loop  Experimental  Angle  Profile 

on  the  ASTREX  Test  Article 


This  figure  shows  that  a  yaw  angular  rotation  of  only  32  degrees  was  accom¬ 
plished  from  a  required  50  degrees.  The  yaw  angular  velocity  at  the  end  of  the 
maneuver  was  in  the  direction  opposite  of  the  maneuver;  this  appears  to  be  the 
result  of  cable  drag.  The  roll  angle  was  again  oscillatory  but  small  and  the  pitch 

sensor  exhibits  the  same  noise  characteristics. 

Figure  10  shows  the  commanded  voltage  to  the  set  of  200-pound  thrusters. 
Each  200-pound  thruster  consists  of  two  components  which  fire  in  opposite  direc¬ 
tions  and  are  measured  and  controlled  separately. 
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The  output  pressure  measured  at  the  nozzle  of  the  200-pound  thrusters  is  shown 
as  Figure  11.  This  figure  illustrates  how  the  two  components  of  each  thruster  work  ^ 

in  unison  to  produce  the  positive  and  negative  components  of  the  input  signal. 

Although  the  reproduction  of  the  input  signal  does  not  deteriorate  near  the  end  of 
the  maneuver,  some  anomalous  pressure  leakage  is  evident. 

Figure  12  shows  the  commanded  voltage  levels  to  the  8-pound  thruster  set 
which  is  used  in  conjunction  with  the  200-pound  thrusters  to  provide  controllability. 

This  figure  shows  that  the  first  two  sets  of  8-pound  thrusters  are  zero  since  they 
have  been  replaced  by  the  200-pound  thrusters.  The  third  set  of  8-pound  thruster 
commands  are  shown  as  the  two  lower  plots. 
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Figure  12  -  Bang-OfF-Bang  Open-Loop  Experimental  8  Lb.  Commanded 
Thruster  Profile  on  the  ASTREX  Test  Article 

The  pressure  sensor  at  the  nozzle  of  the  8-pound  thrusters  is  shown  as  figure 
13.  The  first  two  sets  of  readings  show  that  these  thrusters  are  firing  although  they 
have  been  commanded  to  be  off.  This  phenomena  may  be  the  result  of  electrical 
feedback  within  the  hardware.  Again,  the  third  set  of  8-pound  thrusters  have  output 
deterioration  near  the  end  of  the  maneuver  beginning  at  10  seconds. 

The  final  experimental  figure  (Figure  14)  shows  the  tank  pressure  verses  time. 
It  is  noted  that  at  10  seconds,  where  the  8-pound  thruster  degradation  begins,  the 
tank  pressure  has  fallen  below  150  psi.  Figure  14  shows  the  tank  pressure  verses 
time  for  the  bang-off-bang  control  law.  It  should  be  noted  that  during  the  coast 
period,  the  rate  of  pressure  loss  is  approximately  zero.  This  is  the  characteristic 
of  the  bang-off-bang  control  law  which,  of  course,  that  saves  fuel.  The  fact  that 
there  is  a  measurable  negative  slope,  however,  indicates  that  significant  leakage  is 
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CONCLUSIONS 

A  torque-sliaped  maneuver  approach  for  a  spacecraft  in  three-dimensions  has  ^ 

been  developed  and  demonstrated  to  work  extremely  well  using  open-loop  and 
closed-loop  simulations  for  the  bang-bang  and  the  bang-off-bang  maneuvers.  In 
each  case  tested,  the  open-loop  tracking  error  was  essentially  zero;  the  only  errors 
introduced  in  the  simulation  were  due  to  integration  and  interpolation  errors.  The 
closed-loop  Lyapunov  tracking  control  law  drove  large  initial  tra,cking  errors  to  ^ 

essentially  zero  within  a  few  seconds  and  kept  errors  negligible  until  the  final  time 
using  simulations  where  only  initial  condition  errors  were  introduced.  Additionally, 
only  modest  degradation  of  this  performance  resulted  when  significant  model 
errors  were  introduced  into  the  simulation,  the  Lyapunov  tracking  control  law 
compensated  for  the  model  errors  and  initial  condition  errors,  and  again  regulated 
the  tracking  error  to  essentially  zero  by  the  final  time.  Hence,  the  Lyapunov  tracking  • 

controllers  were  shown  to  be  robust  with  respect  to  modeling  errors  and  initial 

condition  errors.  ,  ,  , 

The  experimental  portion  of  this  research  showed  some  positive  results,  how¬ 
ever,  also  revealed  are  several  hardware  problems  likely  to  be  resolved  with  fu¬ 
ture’ evolution  of  this  experimental  facility.  The  experimental  open-loop  maneuvers  ^ 

showed  the  same  general  trends  as  the  simulated  data  although  they  differed  in  mag¬ 
nitude.  This  discrepancy  appears  to  have  been  caused  by  an  underestimation  of  the 
mass  of  the  structure  and  some  unmodeled  effects  due  to  solenoid  valve  nonUnear- 
ities.  Secondary  problems  are  apparent  in  modeling  the  gimbal  and  cable-follower 
dynamics.  Simulated  maneuvers  using  an  increase  in  mass  of  10%  on  the  open  and 
closed-loop  simulations  were  performed.  The  experimental  data  exhibits  similar 
open-loop  characteristics  to  the  simulated  data  with  a  mass  error.  This  problem 
was  easily  compensated  for  in  simulation  by  closing  the  control  loop.  Closed-loop  ex¬ 
perimental  results  are  not  yet  available  due  to  current  system  hardware  limitations, 
mainly,  the  angular  rate  measurements.  Also,  a  significant  number  of  unexplained 
anomalies  were  encountered  in  the  experimental  results;  however,  these  may  be  # 

considered  typical  of  the  early  experimental  “shakedown”  of  such  a  complicated 
electromechanical  system. 

The  ASTREX  test  results  also  revealed  some  actuator  problems  generating 
the  commanded  thrust  profiles  using  the  8-pound  thrusters  near  the  end  of  the 
maneuver  when  the  tank  pressure  dropped  below  150  psi.  This  problem  stems  ^ 

from  the  fact  that  the  cold  gas  thrusters’  solenoid  valves  were  designed  assuming  a 
constant  back  pressure  of  500  psi.  Our  results  suggest  that  the  design  specification 
of  500  psi  is  quite  conservative;  the  thrusters  operate  reliably  down  to  175  psi 
using  low  thrust  commands.  With  the  present  pressurized  gas  supply  system,  very 
low  tank  pressures  (j  150  psi)  routinely  occurred  because  the  tanks  can  only  be  ^ 

pressurized  between  maneuvers.  The  thrust  generation  problem  could  be  handled 
by  performing  maneuvers  that  only  require  only  a  very  smaU  amount  of  fuel  and 
thus  maintain  a  tank  pressure  above  150  psi,  however,  these  small  angle  maneuver 
are  less  interesting  and  remove  many  of  the  nonlinear  issues  of  intent  from  the 
system  dynamics.  Another  problem  was  the  support  system  in  the  yaw  direction 
which  was  caused  by  to  the  natural  equilibrium  position  of  the  structure  and  a 
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disturbance  torque  due  to  cable  drag  and  cable-follower  dynamics.  These  two 
phenomena  could  also  cause  the  open-loop  experimental  maneuver  to  fall  short  of 
the  required  final  yaw  angle  as  well  as  causing  the  yaw  angle  to  drift  back  towards 
its  starting  orientation  upon  completion  of  the  open-loop  torque  profile.  Each  of 
these  problems  can  be  handled  with  rigorous  modeling  before  deriving  the  open- 
loop  control  law  or  by  using  feedback  compensation  with  appropriate  sensing  system 

enhancements.  ....  j  • 

The  goal  of  this  paper,  to  extend  the  near-mmimum-time  maneuver  design 

technique  to  three-dimensions,  was  accomplished.  The  simulated  results,  both  open- 

loop  and  closed-loop,  were  excellent  and  the  preliminary  experimental  tests  showed 

promising  results. 
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Abstract 

A  new  family  of  orientation  parameters  derived  from  the  Euler  parameters  is  presented.  They 
arc  found  by  a  general  stereographic  projection  of  the  Euler  parameter  constraint  surface,  a  four- 
dimensional  unit  sphere,  onto  a  three-dimensional  h^rplane.  The  r^ulting  set  of  Aree 
stereographic  parameters  have  a  low  degree  polynomi^  non-lineanty  in  the  wrrespondmg 
Idnematic  equations  and  direction  cosine  matrix  parameterization.  The  stereogr^hic  parameters 
are  not  unique,  but  have  a  set  of  “shadow”  parameters.  These  “shadow”  parameters  are  gener^ly 
numerically  different,  yet  represent  the  same  physi^  orientation.  Using  the  ongmal 
stereogrsphic  psrHmcteis  combined  with  their  shadow  set  it  is  possible  to  establish  a  set  of  three 
parameters  which  can  describe  any  rotation  without  a  singularity,  yet  vdth  one  discontinuity.  Tbe 
symmetric  stereographic  parameters  are  ideal  to  describe  departure  motions,  since  they  can  be 
chosen  such  that  they  are  nonsingular  for  up  to  a  principal  rotation  of  ±360®.  The  asymmetnc 
stereographic  parameters  are  well  suited  for  describing  the  kinematic  of  spinning  bodies,  since 
they  only  go  singular  when  oriented  at  a  specific  aiigle  about  a  s^ific  axis.  A  globally  replar 
and  stable  control  law  using  symmetric  stereogriiphic  parameters  is  presented  which  can  bring  a 
spinning  body  to  rest  in  any  desired  orientation  without  backtracking  the  motion. 

Introduction 


While  the  Euler  parameters  (quaternions)  describe  an  arbitrary  orientation  without  a 
singularity,  they  form  a  once-redundant  set.  The  following  development  studies  a  method  to 
stereographically  project  the  Euler  parameters  onto  a  three-dimensional  hyperplane  and  form  a 
family  of  sets  of  three  parameters  called  the  stereographic  parameters.  This  study  is  motivated  by 
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earlier  work  done  by  Marandi  and  Modi  [1],  Tsiotias  [2]  and  Shuster  [3].  In  particular,  Marandi 
and  Modi  introduce  a  set  of  three  parameters  similar  to  the  Rodrigues  parameters  (singular  at  a 
principal  rotation  of  <b  =  ±180“).  which  move  the  singularity  out  to  a  principal  rotation  0  of 
±360“!  Marandi.  Modi  and  Tsiotias  describe  this  modified  set  of  Rodrigues  parameters  as  the 
result  of  a  stereograpMc  projection  of  a  four-dimensional  unit  sphere  onto  a  three-dimensional 
hyperplane.  This  paper  wUl  explore  the  stcreographic  projection  idea  further  and  m  a  more 
generaUzed  way.  and  show  that  both  the  classical  Rodrigues  parameters  and  the  ModiA'siotras 
modified  Rodrigues  parameters  can  be  considered  a  special  case  of  the  general  symmetric 
stereographic  parameters.  Indeed,  the  method  presented  can  be  used  to  construct  a  set  of  three 
symmetric  stereographic  parameters  which  have  their  singular  point  anywhere  between  a 
principal  rotation  of  0“  and  360“.  or  to  construct  a  set  of  three  asymmetric  stcreographic 
parameters  which  have  their  singular  point  determined  by  both  a  principal  angle  and  an  axis  of 
rotation.  Analogous  to  the  Euler  parameters,  the  stcreographic  parameters  arc  generally  not 
unique.  The  Euler  parameters  time  variation,  for  any  physical  motion,  generate  a  trajectory  on  the 
surface  of  the  unit  sphere  constraint  surface.  The  reflection  of  the  Euler  parameters  (reveismg  all 
parameters  signs)  generates  a  second  trajectory  on  the  opposite  of  the  sphere  which  corresponds 
to  the  same  physical  rotation.  Each  set  of  stcreograpMc  parameters  has  a  set  of  “shadow 
parameters”  which  correspond  to  the  reflection  set  of  Euler  parameters.  These  “shadow” 
stcreographic  parameters  are  generally  numerically  different  from  the  original  parameters,  yet 
physically  parameterize  the  same  rotation.  The  developments  presented  herein  are  of  significant 
academic  importance;  using  stcreographic  projections  it  is  easy  to  visualize  the  singularities  of 
tWs  infinite  family  of  three  parameter  sets  which  include  the  classical  and  modified  Rodrigues 

parameters. 

The  modified  Rodrigues  parameters,  as  introduced  by  Marandi  and  Modi,  are  studied  in 
further  detail,  since  they  present  the  largest  range  of  non-singular  rotations  for  the  symmetric 
stereographic  parameters.  In  combination  with  the  corresponding  set  of  “shadow  parameters”,  a 
globally  regular  and  non-singular  Lyapunov  attitude  control  is  established  in  feedback  form. 

The  Euler  Parameter  Unit  Sphere 

The  four  Euler  parameters  are  well  known  and  well  studied  in  the  literature.  They  can  be 
developed  directly  from  Euler’s  principal  rotation  theorem  [3,4].  The  angle  <I>  is  the  principal 


rotation  angle  and  the  unit  vector  f  is  the  principal  line  of  rotation. 

Po  =  COS^  p,  =  ti-sia-  i  =  1.2.3  (1) 

=  Pj+P?+P2+P3  =  ^ 

The  four  Euler  parameters  P,-  abide  by  the  holonomic  constraint  given  in  equation  (2).  This 
equation  describes  a  four-dimensional  unit  sphere.  The  Euler  parameter  trajectories  on  this  sphere 
completely  describe  any  possible  rotational  motion  without  any  singularities  or  discontinuities. 

Note  that  the  Euler  parameters  are  not  unique.  The  mirror  image  trajectory  -p(r)  descnbes 
the  identical  rotational  motion  as  p(0-  The  negative  sign  means  the  rotation  is  accomplished 
about  a  principal  axis  of  the  opposite  direction  through  the  negative  principal  angle.  Usually  this 
non-uniqueness  does  not  pose  any  difficulties  since  both  sets  have  identical  properties,  correspond 
to  the  same  physical  orientation,  and  can  be  solved  uniquely  once  initial  conditions  are 

prescribed. 

Because  the  Euler  parameters  satisfy  one  holonomic  constraint,  they  form  a  once  redundant 
set  of  equations.  Three  parameters  are  sufficient  to  describe  a  general  rotation.  However,  the 
problem  with  any  set  of  three  parameters  is  that,  as  is  well  known,  singularities  will  occur  at 
certain  orientations.  Different  three-parameter  sets  distinguish  themselves  by  having  different 
geometric  interpretations  and,  especially,  having  their  singular  behavior  at  different  onentations. 
Also  of  significance,  most  three-parameter  sets  introduce  transcendental  nonlinearities  into  the 
parameterization  of  the  direction  cosine  matrix  and  related  kinematical  relationships.  However, 
the  classical  Rodrigues  parameters  and  other  sets  discussed  herein  involve  low  degree  polynomial 
nonlinearities  in  both  the  direction  cosine  matrix  and  associated  kinematical  differential  equation, 
without  approximation.  The  Euler  parameter  description  represents  an  attractive  regularization 
which  has  no  singularity,  at  the  cost  of  having  one  extra  variable. 

Stereographic  Projection  of  the  4D  unit  Sphere 

If  a  minimum  parameter  representation  is  desired,  the  four  Euler  parameters  can  be  reduced  to 
any  parameter  set  of  three  by  an  appropriate  transformation.  For  example,  the  3-1-3  Euler  angles 


or  the  Rodrigues  parameters  are  very  commonly  used  sets  that  are  easily  transformed  from  the 
Euler  parameters  [3,4].  Marandi,  Modi  and  Tsiotras  found  a  set  of  modified  Rodrigues  parameters 
by  means  of  a  stereographic  projection  of  the  four-dimensional  unit  sphere  onto  a  three- 
dimensional  hypeq)lane.  To  describe  the  stereographic  projection,  imagine  a  three-dimensional 
sphere  being  projected  onto  a  two-dimensional  plane  (analogous  to  the  Earth  map  projection 
problem).  A  certain  point  is  chosen  in  the  3D  space  called  a  projection  point  Next  a  2D  mapping 
plane  is  chosen.  Every  point  on  the  unit  sphere  is  then  projected  onto  the  mapping  plane  by 
drawing  a  line  from  the  projection  point  through  the  point  on  the  unit  sphere  and  intersected  with 

the  mapping  plane. 


Fig.  1.  Illustration  of  a  Symmetric  Stereographic  Projection  onto  Hyperplane 

Orthogonal  to  po  ^xis. 

Figure  1  shows  only  a  2D  to  ID  stereographic  projection  to  keep  the  illustration  simple.  The 
results  though  can  easily  be  expanded  to  a  four-dimensional  sphere  since  the  axes  are  orthogonal 
to  each  other.  Figure  1  shows  a  2D  unit  circle  getting  projected  onto  a  mapping  line.  With  all  these 
projections  the  Euler  parameter  Pq  eliminated,  since  the  mapping  hyperplane  normal  is  the  Po 
axis.  They  are  called  symmetric  projections  since  the  principal  angle  range  is  symmetric  about  the 
zero  rotation  angle.  Asymmetric  stereographic  projections  are  projections  onto  a  hyperplane  with 


a  normal  other  than  the  Pq  axis,  which  do  not  have  a  symmetric  principal  angle  range.  The  case 
where  the  Euler  parameter  pi,  P2  or  P3  is  eliminated  is  discussed  later  tn  this  paper. 

Placing  the  projection  point  on  the  Pq  axis  yields  an  even  principal  angle  range  about  the  zero 
rotation  point.  The  mapping  line  is  placed  a  distance  of  +1  from  the  projection  point  The 
parameters  are  scaled  by  this  arbitrary  distance,  so  having  a  distance  of  2  between  the  projection 
point  and  the  mapping  plane  would  simply  scale  all  the  parameters  by  a  factor  of  2. 

Keep  in  mind  that  the  Euler  parameters  ate  defined  in  terms  of  half  of  the  principal  rotation 
angle  O.  The  point  (1,0)  on  the  circle  corresponds  to  a  zero  rotation.  The  point  (0,1)  corresponds 
to  a  +180°  rotation.  Studying  Ftg.  1  it  becomes  evident  that  the  reduced  parameters  go  off  to 
infinity  when  a  point  on  the  circle  is  projected  which  lies  directly  in  the  plane  perpendicular  to  the 
Po  axis  through  the  projection  point  The  two  lines  that  need  to  be  intersected  are  parallel  to  each 
other,  causing  the  intersecdon  point  to  move  to  infinity.  The  corresponding  principal  rotation 
obviously  yields  the  angle  at  which  the  reduced  set  of  parameters  wUl  go  singular!  By  placing  the 
projection  point  at  different  locations  on  the  Pq  maximum  principal  rotation  angle  is 

varied.  If  the  projection  point  is  outside  the  unit  circle,  no  singularity  will  occur,  but  the  projectron 
is  no  longer  one-to-one.  Some  areas  of  the  mapping  wUl  start  to  overlap  in  the  projection  plane. 
Clearly  this  is  not  a  desirable  feature  because  of  the  ambiguity  tWs  lack  of  uniqueness  would 
introduce  (given  the  projected  coordinates,  we  cannot  uniquely  orient  the  reference  frame). 

The  angle  O5  is  the  principal  angle  of  rotation  where  the  stereographic  parameter  vector  g 
encounters  a  singularity.  This  angle  O5  determines  the  placement  of  the  projection  point  a. 


The  transformation  from  the  Euler  parameters  to  a  general  set  of  three  symmetric 
stereographic  parameters  g  is  defined  as: 

^  1,2,3  (4) 

Po-" 

The  condition  for  a  syrrunetric  stereographic  parameter  singularity,  evident  in  equation  (4),  is 


shown  below. 
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(5) 


If  a  <  1  this  condition  is  satisfied  at  an  infinite  set  of  orientations.  If  the  projection  point  is  on 
the  unit  sphere  surface,  then  a  =  -I  and  a  singularity  is  only  achieved  at  O  =  ±360^- 


Pn  = 


1+C^C 


(6) 


P.-  =  c, 


i  =  1.2,3 


The  inverse  transformation  ficom  the  general  stereographic  parameters  C  to  the  Euler 
parameters  p,  is  given  in  equation  (6).  This  equation  holds  for  both  the  symmetric  and  asymmetric 
stereographic  projections. 

Since  the  Euler  parameters  are  not  unique,  it  is  valid  to  rewrite  equation  (4)  in  terms  of  -p^. 
For  the  general  case  these  new  stereograpluc  parameters  ^  correspond  to  the  iiurror  image  of  the 
Euler  parameters  and  are  generally  not  numerically  equal  to  C  of  equation  (4).  However,  the 
resulting  vector  will  describe  the  same  orientation  as  the  original  parameters  and  are  herein 
referred  to  as  the  “shadow  points”  of  C  and  are  denoted  with  a  superscript  S: 


(7) 


Using  equation  (6)  the  shadow  point  can  be  expressed  directly  as  a  transformation  of  the 
original  parameters  C  and  the  projection  point  a  as: 


a  +  2a 


(8) 


The  fact  that  the  shadow  point  vector  tf  generally  has  a  different  behavior  than  the  original  ^ 
will  be  useful  when  describing  a  rotation.  The  family  of  stereographic  parameters  generally  has 
two  distinct  sets  of  parameters,  corresponding  to  p  (t)  and  -p  (i) ,  which  describe  the  identical 
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rotation  and  are  related  to  one  another  by  equation  (8). 

The  differential  kinematic  equations  for  C  ^  found  by  differentiating  equation  (4). 


PfpQ 


(9) 


By  making  use  of  the  differential  kinematic  equations  of  the  Euler  parameters  [4]  given  as; 


Po 

P0-P1-P2-P3 

'o' 

Pi 

1 

Pi  Po  “^3  Pz 

®l 

P2 

2 

P2  P3  Po  -Pi 

0)z 

P3 

P3  “Pz  Pi  Po. 

3- 

(10) 


and  the  basic  definition  of  the  stereogtjq)hic  parameters  given  in  equation  (4),  the  differential 
kinematic  equations  for  the  stereographic  parameters  can  be  found.  Their  general  form  is  very 
lengthy  and  not  shown  here  due  to  space  limitations.  The  most  important  special  cases  are 
discussed  below. 

Viewing  Fig.  1,  it  becomes  evident  that  a  set  of  three  symmetric  stereographic  parameters 
cannot  have  the  singularity  point  moved  beyond  a  principal  rotation  of  ±360°.  Going  beyond 
±360°  would  mean  finding  a  projection  point  that  would  map  the  entire  unit  sphere  mote  than 
once,  i.e.  not  a  one-to-one  map  onto  the  projection  plane.  Therefore  the  symmetric  parameters  are 

better  suited  for  regulator  or  “moderately  large”  departure  motion  problems,  than  for  spinning 

} 

body  or  large  angle  maneuver  cases. 

Note  that  for  the  2ero  principal  rotation,  the  asymmetric  stereographic  parameters  are  not 
equal  to  zero.  The  projection  of  the  Po  paranieter  onto  Pj  =  a  +  1  is  not  zero  because  Po  is  one  at 
the  zero  principal  rotation. 

Asynunetric  stereographic  parameters  have  a  qualitatively  different  singular  behavior  from 
the  symmetric  stereographic  parameters.  The  Euler  parameter  Pq  contains  information  about  the 
principal  rotation  angle  only  (i.e.,  the  direction  of  e  does  not  affect  Pq).  Eliminating  Po  during  a 
symmetric  projection  causes  the  singularity  to  appear  at  a  certain  principal  rotation  angle. 
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independent  from  the  principal  axis  of  rotation  t.  Since  for  the  symmetric  projections,  the  zero 
rotation  point  (1,0,0,0)  lies  on  the  po  axis  and  the  singularity  occurs  at  ±<l»s.  we  have  a  symmetric 
range  of  nonsingular  principal  rotations  {-Og  <  O  <  44)5}  about  the  zero  rotation,  regardless  of 

the  direction  of  e. 


Fig.  2.  Illustration  of  a  Asynwnetric  Stercogiaphic  Projection  onto  H3rperplane 

Orthogonal  to  Pj  axis. 

For  an  asymmetric  projection,  one  of  the  Euler  parameters  Pi,  P2.  ot  P3  ^  eliminated.  Each 
one  of  these  parameters  contains  information  about  both  the  principal  rotation  angle  and  the 
direction  of  e.  Therefore  singularities  will  only  occur  at  certain  angles  about  the  i-th  axis 
(corresponding  to  pi).  Figure  2  Ulustrates  an  asymmetric  stereographic  projection  where  pi  is 
eliminated.  All  possible  projections  points  a  now  lie  on  the  pi  axis,  and  the  mapping  hyperplane 
perpendicular  to  pj  is  defined  at  Pj  =  a+1.  Since  the  zero  rotation  is  no  longer  in  the  center  of  the 
nonsingular  principal  angle  range,  the  valid  range  of  principal  angles  is  non-symmetric.  A 
singularity  will  occur  at  Ogi  or  052.  where  these  two  principal  angles  ate  unequal  in  magnitude. 
Given  a  singular  principal  rotation  angle  Ogj  which  lies  between  ±180°,  the  corresponding 
projection  point  a  is  defined  as: 


9 


The  second  singular  principal  rotation  angle  <I>s2  is  then  found  as: 


The  iranstorroadon  from  Euler  parameters  to  the  corresponding  asymmetric  steieographic 
parameters  is  the  same  as  given  in  equation  (4).  with  (1„  and  fc  switched.  A  singularity  now  occurs 
when  Pi  equals  a.  If  the  projection  point «  lies  inside  the  four-dimensional  unit  sphere,  this  may 

occur  at  several  orientations. 


e.-sin-  =  a 


Using  equation  (1).  the  condition  for  a  singularity  becomes  equation  (13).  where  the  index  / 
stands  for  the  pi  parameter  which  was  eliminated.  Since  the  sine  function  is  bounded  between  ±1. 
a  singularity  will  ne^er  occur  if  \e},<a.  If  the  projection  point  a  is  moved  to  the  sphere  surface, 
namely  to  ±1.  then  a  singularity  may  occur  with  a  rotation  about  the  i-th  body  axis  only!  The 
reason  for  this  is  evident  in  equation  (12).  Since  a  is  ±1  and  the  sine  function  is  bounded  within 
±1.  the  only  way  equation  (13)  is  satisfied  is  if  =  1 .  Because  e  is  a  unit  vector,  the  other  two 
direction  components  must  be  zero  if  =  1.  Thus  if  the  body  is  spinning  about  an  axis  other 
than  the  i-th  body  axis,  a  singularity  will  never  occur.  Therefore  these  asymmetric  stereographic 
parameters  are  attractive  for  spinning  body  problems,  where  an  object  is  rotating  mainly  about  a 
certain  axis.  The  principal  rotation  angle  is  now  not  bounded  as  with  the  symmetric  stereographic 
parameters,  but  can  grow  beyond  ±360°.  Simply  choose  the  normal  of  the  projection  hyperplane 
to  be  far  removed  from  the  rotation  axis  and  place  the  projection  point  a  on  the  four-dimensional 
unit  sphere  surface  and  the  probability  of  encountering  a  singularity  is  virtually  nil. 

For  both  the  symmetric  and  asymmetric  stereographic  parameters,  having  the  projection  point 
on  the  sphere  surface  means  the  singularity  can  only  occur  at  two  distinct  orientations.  If  the 
projection  point  lies  inside  the  sphere,  there  generally  exists  an  infinite  set  of  possible  singular 

orientations. 
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The  inverse  transformation  from  asymmetric  steieographic  parameters  to  Euler  parameters  is 
the  same  as  given  in  equation  (6).  These  asymmetric  parameters  also  exhibit  the  same  shadow 
point  behavior  as  the  symmetric  parameters  do  with  the  same  transformation  given  in  equation 
(8).  Therefore,  if  a  singular  orientation  is  approached  with  the  asymmetric  stcreographic 
parameters,  one  can  switch  to  the  shadow  point  to  avoid  the  singularity. 

Classical  Rodrigues  Parameters 

The  Rodrigues  parameters  q  have  a  singularity  at  O  =  ±180°.  This  corresponds  to  a  point  on 
the  two-dimensional  unit  circle  in  Rg.  1  of  (04:1).  The  corresponding  symmetric  stereographic 
projection  has  the  projection  point  a  at  the  origin  and  the  mapping  line  at  po  =  1-  1^  becomes 
evident  why  the  classical  Rodrigues  parameters  must  go  singular  at  O  =  ±180°  when  descnbmg 
them  as  a  special  case  of  the  symmetric  stereogrrqphic  parameters.  The  transformation  from  the 
Euler  parameters  to  the  Rodrigues  parameters  q  is  found  by  setting  <i>s  =  ±180°  in  equation  (3-4). 
The  well  known  result  is  shown  in  equation  (14)  below. 


i  =  1.2,3 


(14) 


The  inverse  transformation  from  the  Rodrigues  to  the  Euler  parameters  is  found  by  using  the 
same  O5  in  equation  (6)  and  is  given  as: 


I 

./l  +^q 


i  =  1.2.3 


(15) 


The  differential  kinematic  equation  in  terms  of  the  classical  Rodrigues  parameters  is  given  in 
vector  form  as: 


S 


i(©-  {(ss]q  +  q^q) 


(16) 


An  explicit  matrix  form  of  equation  (16)  is  given  below  [4]. 


II 


1 

2 


1+9?  9i92“93  9i93+92 

92^1 +  93  *■‘■^2  9293  ~9t 

19391-92  9392  +  91  1+9?  J 


(17) 


Using  the  definitions  of  the  Euler  parameters  in  equation  (1).  the  Rodrigues  parameteis  can 
also  be  expressed  directly  in  terms  of  the  principal  rotation  angle  O  and  the  pnnapal  line  of 

rotation  g. 


q  =  stan¬ 


ds) 


From  equation  (18),  it  is  obvious  why  the  classical  Rodrigues  parameters  go  singular  at 
±180°.  For  completeness  the  direction  cosine  matrix  C  is  given  in  explicit  matrix  form  [4]: 


l+9?  +  9?+9? 


1+9? -92 -9?  2(9i92  +  93) 

2(9i92  -  93)  1-9?  +  92  -  93  7(^293  +  9i) 
2(9391+92)  2(9293-9i)  1-9? -9? +9? 


(19) 


and  in  vector  form  [3]: 


Cig)  =  — L- ((1-/9)1+299  -2 t9l) 

I  +  /9 


(20) 


Equation  (20)  and  its  inverse  can  also  be  written  as  the  Cayley  Transform  [3,4,6]: 


C(9)  =  (/-  [9])  (/+  [9I)  ‘ 


(21a) 


[9]  =  (/-C)(/+Q 


-I 


(21b) 


and  the  kinematic  differential  equation  shown  in  equations  (16-17)  has  the  (Zayley  form  [4]. 


^[91  =  i(/-[9l)l“Hl-l9l) 

at  L 


(22) 
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The  tilde  matrix  is  defined  by  -  x ...]  as  given  in  equation  (23). 


[q\ 


0  92I 

-92  9l  0 


(23) 


Let  the  vector  /  (defined  with  -g)  denote  the  shadow  point  of  the  classical  Rodrigues 
parameters.  Solving  equation  (6),  or  starting  with  equation  (14),  the  following  definition  for  the  / 
is  found. 


-Po'Po"’' 


/  =  1.2,3 


(24) 


Equation  (24)  shows  that  for  the  Rodrigues  parameters,  the  shadow  point  vector  components 
are  to  the  original  Rodrigues  parameters,  with  identical  values  and  properties.  Therefore 

the  shadow  point  concept  is  of  no  practical  consequence  in  this  casej  the  classical  Rodrigues 
parameters  are  unique! 


Fig.  3.  Original  and  “Shadow  Point”  Projection  of  the  Classical  Rodrigues  Parameters. 

Having  the  projection  point  a  at  the  origin  causes  this  elegant,  degenerate  phenomenon. 
Figure  3  illustrates  why  both  sets  of  Rodrigues  parameters  are  identical.  The  classical  Rodrigues 
parameters  are  the  only  syrrunetric  stereographic  parameters  which  exhibit  this  lack  of  distinction 


between  the  original  parameters  and  their  shadow  point  counterparts,  as  is  evident  below.  This 
proves  simultaneously  to  be  an  advantage  and  a  disadvantage. 

Modified  Rodrigues  Parameters 

The  modified  Rodrigues  parameters  presented  by  Modi  and  Tsiotras  move  the  projection  point 
to  the  far  left  of  the  unit  sphere  at  (-l.O.O.O)  and  project  the  Euler  parameters  onto  the  hyperplane 
at  po  =  0-  ^  singularity  as  far  away  from  the  zero-rotation  as  possible.  The 

parameters  will  now  go  singular  at  =  ±360».  As  Tsiotras  points  out,  this  new  set  of  parameters 
is  able  to  describe  any  type  of  rotation  except  a  complete  revolution  back  to  its  original 
orientation.  Carrying  out  the  stereographic  projection  with  O5  =  +360^  the  transformation  from 
Euler  parameters  to  the  modified  Rodrigues  parameter  vector  g  and  the  inverse  transformation  are 

given  as: 


o,  = 


Pi 


'  l+Pfl 


i  =  1,2,3 


(25) 


^0=17^  - 


1+0^0 


i  =  1.2,3 


(26) 


Using  equation  (1)  again,  the  modified  Rodrigues  parameters  can  be  written  as  [2]: 


<I> 

c  =  etan  — 
"  '  4 


(27) 


This  formula  immediately  reveals  the  singularity  at  a  principal  rotation  of  ±360°,  double  the 
range  of  the  classical  Rodrigues  parameters.  It  is  interesting  that  O  =  0°  and  O  =  ±360° 
correspond  physically  to  the  same  body  orientation.  This  fact  has  both  theoretical  and  practical 

consequences  in  “avoiding”  the  singularity. 


0  = 


[Ci)]0  +  0Cp 


(28) 


The  kinematic  differential  equations  in  terms  of  g  are  given  in  equation  (28).  They  are  very 


similar  to  equadoti  (16)  except  for  one  extra  term.  This  terms  makes  the  equations  only  slightly 
mote  complicated,  but  not  any  more  non-linear. 

The  explicit  matrix  form  for  the  elements  of  equation  (28)  is  given  as  [2]: 


(l+o^-o^-o?) 

2(0,03-03) 

2(0,03  +  03) 

0, 

2  (030, +  03) 

(l-oj  +  o^-o^) 

2(0303-0,) 

©3 

2(030,-03) 

2(0303+0,) 

(l-oj-o^  +  o^) 

“3 

The  direction  cosine  matrix  in  terms  of  the  modified  Rodngues  parameters  [2]  can  be  shown  to 
be: 


C(g)  = 


(i+g'^sy)' 


80,03-402!: 


4(o5-o5-o^) +x?  80,02+403!: 

80,03—403!:  4(-o^  +  o^-o^)  +!?  80203  +  4o,E 

80,03 + 403!:  -  4o,I  4  (-0^  -  o^ + o^)  + 

X=  1-g^g 


(30) 


or  more  compactly  in  vector  form  as  [3]: 


C(g)  =  I- 


4(1 -g^g) 


lo]+- 


(l+g^g)  (1+g^g) 


l<Tl 


(31) 


The  modified  Rodrigues  parameter  vector  g  is  transformed  into  classical  Rodrigues 
parameters  as: 


(32) 


Naturally,  this  transformation  goes  singular  at  a  principal  rotation  of  ±180°,  because  |lgl|  ->  1 
3Xlcl  II  ^11  — ^  ^  O— >il80^. 

Comparing  equation  (27)  and  equation  (18)  it  is  immediately  evident  that  both  the  classical 
and  the  modified  Rodrigues  parameter  vectors  have  the  direction  of  the  principal  rotation  vector  #, 
but  a  different  magnitude.  The  transformation  from  modified  to  classical  Rodrigues  parameters 


shown  in  equation  (32)  can  be  rewritten  in  terms  of  the  principal  angle  of  rotation  O. 


9  = 


o 

tzn- 

tan  — 
4 


(33) 


Using  the  image  set  -g(0  of  Euler  parameters,  the  shadow  point  of  the  modified  Rodrigues 
parameter  vector  <j  is  found. 


(34) 


Contrary  to  the  classical  Rodrigues  parameters,  these  modified  Rodrigues  parameter  shadow 
points  are  not  numerically  equal  to  the  original  parameters.  While  they  generate  exactly  the  same 
direction  cosine  matrix,  they  are  not  generally  a  mirror  image  of  one  another.  While  generally 
note  that  everywhere  on  the  unit  sphere  =  I  that,  in  fact,  =  -g  =  -p,-  Tlus  simple 
observation  has  significant  practical  consequences. 


Fig.  4.  Original  and  “Shadow  Point”  Projection  of  the  Modified  Rodrigues  Parameters. 
The  shadow  points  g^  have  some  interesting  properties.  They  go  singular  at  the  zero  rotation 
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and  go  to  zero  at  a  ±360°  prindpal  rotation!  Hiia  is  the  exaa  opposite  of  the  qualitative  behavior 
of  ff.  The  reason  for  this  behavior  becomes  evident  in  Fig.  4.  At  a  zero  rotaUon,  the  shadow  point 
wiU  intersect  the  mapping  Une  at  infinity.  At  a  rotafion  of  ±180°  the  shadow  points  will  be  fte 
negafive  of  their  ori^  values.  We  note  that  s'  is  disfinguished  ftom  s  merely  for  book-keeping 
purposes.  Transfotming  initial  conditions  (from  iq  or  g)  for  any  pven  case,  could  initiate  moUon 

on  either  a  (r)  or  (0  • 

Using  s  together  with  the  shadow  vector  it  is  possible  to  describe  any  rotation  without 
singularities  and  with  only  three  parameters,  but  with  one  discontinuity  at  the  switching  point  If 
the  original  a(r)  trajectory  approaches  the  singularity  at  <I>  =  ±360^  the  vector  c(r)  can  be 
switched  to  the  shadow  trajectory  /(t) .  This  transformation  is  very  simple  as  is  seen  in  equation 
(34).  Rather  than  waiting  until  lo(01  ~  or  |a^(r)l  « to  switch,  however,  the  most  convenient 

switching  surface  is  the  g^s  =  I  sphere;  the  unit  sphere  which  corresponds  to  a  principal  rotation 
of  ±180°.  The  Euler  parameter  is  zero  everywhere  on  this  sphere.  This  causes  the  shadow  pomt 
to  have  the  same  unit  magnitude  as  the  original  with  the  transformation  being  /  =  -i?.  Thus 

whenever  g(r)  exits  (enters)  the  unit  sphere,  /(O  enters  (exits)  at  the  opposite  side  of  the  sphere. 


Fig.  5.  Illustration  of  the  Original  and  Shadow  Modified  Rodrigues  Parameter, 


Switching  at  the  ^5  =  I  surface  can  be  very  elegandy  accomplished  when  finding  c  by 
extracting  the  Euler  parameters  from  the  direction  cosine  matrix.  Simply  keep  and  the 
resulting  set  of  parameters  will  always  have  ^  I  [1].  Switching  on  the  po  =  0  sphere  (where 
Jg  =  =  1)  keeps  the  combined  set  of  original  and  shadow  points  bounded  within  the  unit 

sphere. 

This  bounded  behavior  of  the  combined  set  is  illustrated  in  Fig.  5  above.  The  grey  line 
represents  the  ff(r)  trajectory  and  the  black  fine  the  corresponding  shadow  trajectory  of  5^(0. 
The  motion  starts  out  at  a  zero  rotation  with  the  grey  line  at  the  origin  and  the  black  line  at 
infinity.  After  a  whUe  the  principal  angle  of  the  object  grows  beyond  180"  and  the  grey  Une  exits 
the  unit  sphere.  At  the  same  time  tive  shadow  parameters  (black  line)  enter  the  sphere  at  the 
opposite  position.  If  the  body  rotates  back  to  the  original  orientation,  the  shadow  parameters 
approach  zero  as  the  original  parameters  go  off  to  infinity.  Any  tumbUng  motion  would  give  rise 
to  a  qualitatively  identical  discussion  of  ff(0  and  (0 . 

Example  of  Asymmetric  Stereographic  Parameters 

A  sample  set  of  asymmetric  stereographic  parameter  vector  g  is  constructed  by  eliminating 
the  Euler  parameter  pi  and  setting  a  equal  to  -1.  Adjusting  equation  (4),  the  vector  g  is  defined  as: 


^i 


(35) 


Using  equation  (1 1,12)  the  singular  principal  rotations  about  the  positive  pi  axis  become  ^»si 
=  -180°  and  Osi  =+540°.  As  mentioned  earlier,  the  direction  at  which  a  singular  orientation  is 
approached  is  important  with  asymmetric  stereographic  parameters.  Here  a  negative  pnncipal 
rotation  of  180°  about  the  first  body  axis  causes  a  singularity.  A  positive  principal  rotation  of  180 
would  yield  an  identical  physical  position,  yet  causes  no  singularity.  Only  after  a  +540°  does  this 
representation  go  singular,  even  though  this  position  is  the  same  as  +180°.  This  non-symmetnc 
principal  angle  range  is  due  to  the  fact  that  the  zero  rotation  point  (±1 ,0,0,0)  does  not  lie  on  the  pi 

axis. 

Differentiating  equation  (35)  and  using  equation  (10),  the  differential  kinematic  equation  for 
vector  g  is  found  to  be: 


(36) 


(-1-3?  +  B2  +  B?) 

2(niB3-B2) 

-2(3,32+B3) 

0, 

1 

B  =  4 

2(Tl3-Tli32> 

2(32^3  +  B,) 

(-1+3|-B2+^3) 

®2 

-2(3,33 +n2) 

(l-3j-n2+B?) 

2(3,-B2B3) 

“3. 

Note  that  equation  (36)  contains  no  transcendental  functions  in  it  and  is  similar  qualitatively  to 
equation  (29).  Because  tj  is  an  asymmetric  stereographic  parameter  vector,  however,  there  is  less 
symmetry  in  the  matrix.  This  lack  of  synunetry  is  Unked  with  the  absence  of  a  symmetric 
principal  rotation  angle  range.  Therefore,  equation  (36)  cannot  be  written  in  a  more  compact 
vector  as  was  the  case  with  the  symmetric  stereographic  parameters. 

The  direction  cosine  matrix  in  terms  of  3  can  be  found  to  be: 


+1:^  8^1^3+41123:  -811,112+41133: 

-8t1j113  +  4i123:  4(11^  +  112-113) -3?  8il2il3+4il,3:  ^^7) 

81IJII2+41I33:  8TI2II3-41IJ2:  4(il5-il2+^3^ 

3:=  1-3^3 

Analogously,  asymmetric  stereographic  parameters  could  be  derived  by  projecting  onto  a 
hyperplane  orthogonal  to  the  P2  o*"  Ps  actually  any  non-Po  axis.  All  these  parameters 

would  have  a  similar  singular  behavior. 

To  illustrate  the  use  of  the  asymmetric  stereographic  parameters  3  for  describing  a  spinning 
body,  a  sample  motion  was  generated.  The  motion  was  achieved  by  forcing  the  following  3-1-3 
Euler  angle  time  history  upon  the  body. 

ej(0=/  =  (l-cos20^  Ojto  =  (Sin2f)^  (38) 

The  body  is  mainly  spinning  about  the  third  body  axis  while  oscillating  about  the  other  two. 
Therefore  the  stereographic  parameter  vector  3  will  never  go  singular,  since  a  singularity  can  only 
occur  with  a  pure  rotation  about  the  first  body  axis. 


C(B)= - '—2 

(1+3  3) 
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As  Fig.  6  shows,  the  asymmetric  stereographic  parameters  3  are  smooth  and  continuous  at  all 
time.  The  sample  motion  shown  performs  one  and  a  half  revolutions  without  encountering  any 
singularity. 


Fig.  7.  Comparison  of  Symmetric  and  Asymmetric  Stcreographic  Parameters. 

To  compare  the  asymmetric  with  the  symmetric  stereographic  parameter  description  for  this 
spinning  body,  the  polar  plot  in  Fig.  7  was  generated.  The  magnitude  of  each  parameter  vector  is 
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plotted  versus  the  principal  rotation  angle  <j>.  As  expected,  the  symmetric  stercographic  parameters 
go  singular  at  certain  <j>.  while  the  vector  g  is  bounded  at  all  tinres. 

Figure  8  shows  the  time  history  of  the  principal  rotation  angle  <j)  for  this  spinning  body 
maneuver.  Because  of  the  oscillations  about  the  first  and  second  body  axis,  4>  gets  reduced  during 
some  portions  of  the  maneuver.  Because  the  magnitude  of  the  symmetric  stercographic 
parameters  depends  only  on  the  principal  rotation  angle,  these  “backing  up"  phases  arc  not  visible 
on  the  polar  plot  in  Fig.  7.  However,  the  magnitude  of  the  asymmetric  stercographic  parameters 
depends  on  both  the  principal  rotation  angle  and  the  direction  of  the  principal  rotation  axis.  This 
explains  the  more  irregular  features  of  the  jq|  plot  in  Fig.  7. 


Fig.  8.  Principal  Rotation  Angle  Time  History  of  Spinning  Body  Maneuver 

While  some  loss  in  symmetry  and  elegance  of  the  equations  results,  asymmetric  sets  of 
stercographic  parameters  are  able  to  represent  the  motion  of  a  spinning  body  without  switching 
between  the  shadow  and  the  original  parameters,  like  the  modified  Rodrigues  parameters  would 
require.  In  [7]  Tsiotras  develops  a  set  of  orientation  parameters  which  arc  also  well  suited  for  the 
spinning  body  problem  and  have  a  low  polynonual  degree  nonlinearity  in  their  kinematic 
equations.  They  differ  in  form  to  the  asymmetric  stereographic  parameters,  but  are  similar  in 
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Globally  Stable  Control  using  Modified  Rodrigues  Parameters 

• 

The  combiacd  set  of  modified  Rodrigues  parameters  and  their  shadow  counterparts  lead 
themselves  very  well  for  regulator  type  control  design.  Adopting  the  switching  surface  =  I 
has  a  suq)rising  benefit  in  designing  control  laws.  Consider  the  dynamics  of  a  generally  tumbling 
rigid  body.  The  Lyapunov  function  • 


g) 


=  i^Ow  +  2faog(l  +g^g) 


(39) 


will  not  have  any  discontinuities  at  the  switching  surface,  since  both  the  original  g  and  its 
shadow  point  have  unit  magnitude  there!  V^(^,  g)  is  by  inspection  only  zero  if  both  co  and  g 
are  zero.  As  a  consequence,  it  is  easy  to  establish  a  globally  stable  Lyapunov  controller  with  a 
three  rotation  parameter  set  which  never  encounters  a  singularity'!  J  in  equation  (39)  denotes  the 
3x3  inerda  matrix  in  body  axis.  The  scalar  K  is  a  positive  feedback  gain.  For  this  nonlinear 
regulator  type  problem,  the  external  control  torque  u  is  found  by  setting  the  time  derivative  of 
equation  (39)  equal  to 


V  = 


(40) 


with  P  being  a  positive  definite  matrix,  and  using  equation 
motion: 


(28)  and  Euler’s  equation 


of 


70  =  -  (<ol  7co  +- 1/ 


(41) 


to  solve  for  the  torque  u.  Using  the  logarithm  of  g^g  in  equation  (39)  results  in  a  globally 
nonlinear  control  law  u  which  is  linear  in  g  [2]. 

u  =  ~  Kg  +  {ci)l  7^  (42.) 


The  control  law  in  equation  (42)  is  valid  for  any  arbitrary'  departure  motion  g.  Conventional 
sets  of  tlurec  parameters  would  encounter  singular  orientations.  Another  problem  with 
conventional  parameter  sets  is  that  they  have  no  inherent  n>cchanism  to  accommodate  tumbling 
situations  when  the  object  has  performed  a  principal  rotation  beyond  ±180"^  away  from  the  desired 
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state.  When  this  happens,  it  would  probably  be  desirable  to  “help”  the  object  complete  the 
revolution,  rather  than  to  attempt  to  force  it  back  the  way  it  came.  The  only  set  of  parameters  that 
can  “almost”  handle  this  scenario  is  the  classical  set  of  Rodrigues  parameters.  They  fail  because 
they  go  singular  near  the  “up-side-down”  orientation  at  <^=±m\  The  combined  set  of  g  and 
however,  are  well  behaved  up  to  and  well  beyond  <l>  =  ±180^  Switching  at  /sy  =  1  makes  it 
possible  for  the  control  law  to  let  the  object  go  past  the  “up-side-down”  orientation  and  then  let  it 
rotate  back  to  the  origin  the  short  way,  as  we  illustrate  in  an  example  below. 

The  angular  velocity  ©  feedback  is  required  for  global  stability,  and  the  P  matrix  should  be 
chosen  to  achieve  satisfactory  damping  of  the  nonlinear  oscillations. 

The  results  of  a  single-axis  spin  maneuver  using  the  control  law  in  equation  (43)  are 
presented.  The  inertia  J  used  was  12000  kgm^;  the  feedback  gmns  were  chosen  as  K=300  and 
P=1800.  Initial  angular  velocity  was  +607s.  Rgure  9  below  shows  the  time  Wstory  of  the 
principal  angle  of  rotation.  The  object  clearly  spins  beyond  the  “up-side-down”  point  of  <I>=+180° 
and  then  returns  back  to  the  origin  by  continuing  the  motion  and  completing  the  revolution.  The  © 
feedback  sufficiently  dampens  the  system  to  prevent  excessive  oscillations  about  the  origin. 


The  angular  velocity,  shown  in  Fig.  10,  decreases  steadily  from  +607s  and  converges  to  zero. 
Where  the  O  goes  beyond  180°  there  is  a  discontinuity  in  the  slope  of  ©. 


23 


Fig.  10.  Angular  Velocity  of  Spin  Maneuver. 


The  corresponding  external  control  torque  is  presented  in  Fig.  1 1.  A  large  torque  is  demanded 
initially  because  of  the  laige  initial  angular  velocity  ©.As  ©  decreases,  so  does  the  torque.  There 
is  a  discontinuity  where  the  modified  Rodrigues  parameter  switch  from  the  original  to  the  shadow 
point  trajectory.  This  is  because  the  position  error  g  reversed  its  sign,  driving  the  object  towards 
the  origin  about  the  other  way.  However,  the  control  torque  does  not  jump  to  a  negative  value 
because  of  the  ©  feedback.  It  keeps  the  torque  positive;  i.e.  the  controller  is  still  slowing  down  the 
spin,  even  during  the  switching. 


% 


% 


time  [s] 

Fig.  11.  External  Control  Torque  of  Spin  Maneuver. 

The  position  error  and  the  associated  torque  discontinuity  due  to  switching  to  the  shadow 
trajectory  may  be  troublesome  for  highly  flexible  bodies.  However,  this  is  easily  addressed  in 
practice  by  replacing  the  instantaneous  switch  by  a  smooth  one.  Also,  introducing  a  simple  digital 
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filter  will  effectively  smooth  out  such  jump  discontinuities. 

It  is  conceptually  easy  to  introduce  a  reference  trajectoty  and  design  analogous  tracking-type 
feedback  control  with,  using  the  methods  of  [4],  global  stability  guaranteed.  This  is  useful  in 
achieving  global  control  stuping,  and  also  to  permit  selection  of  feedback  gains  sufficiently  large 
to  reject  disturbances. 

Conclusion 

A  new  family  of  stereographic  parameters  has  been  presented,  including  the  general 
transformation  from  and  to  the  Euler  parameters.  The  general  stereographic  parameters  are  not 
unique  and  have  a  corresponding  set  of  shadow  point  parameters  whose  singular  behavior  is 
different  from  the  original  parameters. 

The  classical  Rodrigues  parameters  are  a  special  set  of  the  symmetric  stereogr^hic 
parameters  where  the  original  parameters  and  their  shadow  points  coincide.  The  modified 
Rodrigues  parameters  are  also  a  special  case  of  the  symmetric  stereographic  parameters.  They 
have  the  largest  non-singular  principal  angle  range  of  ±360®.  Their  associated  shadow  points  are 
singular  at  the  zero  rotation  and  zero  and  O  =  ±360°.  This  combined  set  of  stereographic 
parameters  and  their  shadow  point  parameters  are  able  to  describe  any  rotation  without 
encountering  a  singularity,  but  with  one  discontinuity. 

The  asymmetric  stereographic  parameters  have  their  singular  orientations  defined  both  by  an 
axis  and  a  principal  rotation  angle.  The  two  singular  angles  do  not  have  equal  magnitude  as  with 
the  symmetric  stereographic  parameter.  Asymmetric  parameters  do  allow  rotations  beyond  ±360° 
and  are  therefore  attractive  to  spinning  body  type  problems. 

The  globally  stable  control  law  presented  implicitly  "knows”  when  an  object  has  rotated 
beyond  ±180°  from  the  target  state,  and  to  let  it  complete  the  revolution  back  to  the  desired  state. 
This  control  implicitly  seeks  out  the  smallest  principal  rotation  angle  to  the  target  state.  This 
control  law  was  developed  by  making  use  of  the  modified  Rodrigues  parameter  and  their  shadow 
points. 
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OUTLINE 

Introduction  and  Motivation 
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Euler^s  Principal  Rotation  Theorem 

A  rigid  body  (ref.  frame)  can  be  brought  from  an  arbitrary 
initial  orientation  to  an  arbitrary  final  orientation  by  a  single 
rotation  (O)  about  a  principal  line  (e). 
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Well,  so  what?  How  does  all  of  this  generalize 
to  something  other  than  rigid  body  dynamics  ? 

Consider  the  following  theorem*  due  to  Cayley  (Cayley  Transform): 
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immediately  generates  the  classical  Rodrigues  parameters  equation 
for  the  direction  cosine  matrix  and  the  inverse  thereof  Hmmmm! 
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Kinematics  ofnxn  Rotations 

We  can  show*  that  nxn  orthogonal  matrices  evolve  according  to 
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Spectral  Parameterization  ofnxn  Symmetric  Matrices 

An  arbitrary  symmetric  positive  matrix  P  has  the  spectral  decomposition 

P-CKC^,  k  =  diag{'k\,  C^C  =  I 
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In  particular,  ifP(t)  satisfies  a  differential  equation,  then,  we  can  transform 
the  differential  eqn.  for  P(t)  into  new  eqns.for  \(t)  and  Q(t),  This  time 
varying  spectral  decomposition*  seems  elegant  and  attractive^  An  example  . 


Example:  Transformation  of  Matrix  Riccati  Equation 

A  frequently  occuring  diff.  eqn.  in  optimal  control  and  optimal  estimation  is  the 
Matrix  Riccati  Equation: 
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These  eqns  may  look  messy,  and  they  are!  They  also  behave  poorly  near  repeated 
eigenvalues  ofP,  but  they  are  attractive  conceptually! 


Macroscopic  Properties  of  n-Dimensional 
Parameterizations  of  Orthogonal  Matrices 

station  i  generaljzatigns _ 
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Concluding  Remarks 

Parameterizations  of  3x3  orthogonal  projection  matrices 

Classical  Rigid  Body  Principal  Rotation  Coordinates 

Euler  Parameters,  Rodrigues  Parameters,  Modified  Rod.  Parameters 
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Approach  for  Parameterization  of  nxn  Positive  Definite  Matrix 

Covariance  Matrices,  Weight  Matrices,  Solutions  of  Riccati  Eqns, ... 

Thank  you  Cayley  et  al,  for  leaving  me  a  few  fun  things  to  do! ! 
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Principal  Rotation  Representations 
of  Proper  NxN  Orthogonal  Matrices 


Hanspeter  Schaub 
Panagiotis  Tsiotras 
John  L.  Junkins 


Abstract 

Three  and  four  parameter  representations  of  3x3  orthogonal  matrices  are  extended  to  the  gen- 
•  eral  case  of  proper  NxN  orthogonal  matrices.  These  developments  generalize  the  classical  Ro¬ 

drigues  parameters,  the  Euler  parameters,  and  the  recently  introduced  modified  Rodrigu^ 
eters  to  higher  dimensional  spaces.  The  developments  presented  are  motivated  by,  and  signifi¬ 
cantly  generalize  and  extend  the  classical  result  known  as  the  Cayley  transformation. 


•  Introduction 

It  is  well  known  in  rigid  body  dynamics,  and  many  other  areas  of  Euclidean  analysis,  that  the 
rotational  coordinates  associated  with  Euler’s  Principal  Rotation  Theorem  [1,2,3]  lead  to  espe- 

0  daily  attractive  descriptions  of  rotational  motion.  These  parameterizations  of  proper  orthogonal 

3x3  matrices  include  the  four-parameter  set  known  widely  as  the  Euler  {quaternion)  parameters 
[1,2,3],  as  well  as  the  classical  three-parameter  set  known  as  the  Rodrigues  parameters  or  Gibbs 
vector  [1,2,3,4].  Also  included  is  a  recently  introduced  three  parameter  description  known  as  the 

•  modified  Rodrigues  parameters  [4,5,6].  As  we  review  briefly  below,  these  parameterizations  are 
of  fundamental  significance  in  the  geometry  and  kinematics  of  three-dimensional  motion. 
Briefly,  their  advantages  are  as  follows: 

^  Euler  Parameters:  This  once  redundant  four-parameter  description  of  three-dimensional  rota¬ 

tional  motion  maps  all  possible  motions  into  arcs  on  a  four-dimensional  unit  sphere.  This  accom¬ 
plishes  a  regularization  and  the  representation  is  universally  nonsingular.  The  kinematic  differen¬ 
tial  equations  contain  no  transcendental  functions  and  are  bi-linear  without  approximation. 

^  Classical  Rodrigues  Parameters:  This  three  parameter  set,  also  referred  to  as  the  Gibbs  vec¬ 

tor,  is  proportional  to  Euler’s  principal  rotation  vector.  The  magnitude  is  tan{^l2),  with  (j)  being 
the  principal  rotation  angle.  These  parameters  are  singular  at  <|)  =  ±7t  and  have  elegant,  quadrati- 
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cally  nonlinear  differential  kinematic  equations. 

Modified  Rodrigues  Parameters:  This  three  parameter  set  is  also  proportional  to  Euler’s  prin¬ 
cipal  rotation  vector,  but  with  a  magnitude  of  tani^l4).  The  singular  orientation  is  at  <J)  =  ±27t,  dou¬ 
bling  the  principal  rotation  range  over  the  classical  Rodrigues  parameters.  They  also  have  a  quad¬ 
ratic  nonlinearity  in  their  differential  kinematic  equations. 

The  question  naturally  arises;  can  these  elegant  principal  rotation  parameterizations  be  ex¬ 
tended  to  orthogonal  projections  in  higher  dimensional  spaces?  Cayley  partially  answered  this 
question  in  the  affirmative;  his  “Cayley  Transform”  fully  extends  the  classical  Rodrigues  parame¬ 
ters  to  higher  dimensional  spaces  [1,2,7].  A  proper  NxN  orthogonal  matrix  can  be  generally  para¬ 
meterized  by  a  vector  with  dimension  M  =  ViN(N-l).  Only  for  the  3x3  case  is  N  equal  to  M.  Any 
proper  orthogonal  matrix  has  a  determinant  of  +1  and  can  be  interpreted  as  analogous  to  a  rigid 
body  rotation  representation.  This  paper  extends  the  classical  Cayley  transform  to  parameterize  a 
proper  NxN  orthogonal  matrix  into  a  set  of  M-dimensional  modified  Rodrigues  parameters.  Fur¬ 
ther,  a  method  is  shown  to  parameterize  the  NxN  matrix  into  a  once-redundant  set  of 
(M+l)-dimensional  Euler  parameters. 

The  first  section  wiU  review  the  Euler,  Rodrigues  and  the  modified  Rodrigues  parameters  for 
the  3x3  case,  generalized  later  in  this  paper  to  parameterize  the  proper  NxN  orthogonal  matrices. 
The  second  section  will  review  the  classical  Cayley  transform  resulting  with  the  representation  of 
a  proper  orthogonal  matrix  using  the  Rodrigues  parameters,  followed  by  the  new  representation 
of  the  NxN  orthogonal  matrices  using  an  M-dimensional  set  of  modified  Rodrigues  parameters, 
and  finally,  a  new  representation  of  the  NxN  orthogonal  matrices  using  an  (M+l)-dimensional 
Euler  parameters. 


Review  of  Three-Diinensional  Rigid  Body  Rotation  Parameterizations 
The  Direction  Cosine  Matrix 

The  3x3  direction  cosine  matrix  C  completely  describes  any  three-dimensional  rigid  body  ro¬ 
tation.  The  matrix  elements  are  bounded  between  ±1  and  possess  no  singularities.  The  famous 
Poisson  kinematic  differential  equation  for  the  direction  cosine  matrix  is: 

C  =  -[a)]C  (1) 


where  the  tilde  matrix  is  defined  as 


[d)]  = 


0 

0)3 

-0)2 


-0)3 

0 

0)1 


0)2 

-0)1 

0 


(2) 
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The  direction  cosine  nwtrix  C  is  orthogonnl,  therefore  it  satisfies  the  following  constmint. 

C^C  =  CC^=I  (3) 

This  constraint  causes  the  direction  cosine  matrix  representation  to  be  highly  redundant.  In¬ 
stead  of  considering  all  nine  matrix  elements,  it  usually  suffices  to  parameterize  the  matrix  into  a 
set  of  three  or  four  parameters.  However,  any  minimal  set  of  three  parameters  will  contain  singu¬ 
lar  orientations. 

The  constraint  in  equation  (3)  shows  that  besides  being  orthogonal,  the  direction  cosine  matrix 
is  also  normal  [8].  Consequendy  it  has  the  spectral  decomposition 

C=UAU*  (4) 

where  17  is  a  unitary  matrix  containing  the  orthonormal  eigenvectors  of  C,  and  A  is  a  diagonal 
matrix  whose  entries  are  the  eigenvalues  of  C.  The  *  symbol  stands  for  the  adjoint  operator, 
which  takes  the  complex  conjugate  transpose  of  a  matrix.  Since  C  represents  a  rigid  body  rota¬ 
tion,  it  always  has  a  determinant  of  +1 . 

The  Principal  Rotation  Vector 

Euler’s  principal  rotation  theorem  states  that  in  a  three-dimensional  space,  a  rigid  body  (refer¬ 
ence  frame)  can  be  brought  from  an  arbitrary  initial  orientation  to  an  arbitrary  final  orientation  by 
a  single  principal  rotation  (<)))  about  a  principal  line  e  [3]. 
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With  reference  to  Fig.  1,  the  body  axis  Bi  components  of  the  principal  line  e  are  identical  to 
the  spatial  components  projected  onto  w,- . 

|e2}=l  =  Ce  (5) 

Therefore  e  most  be  an  eigenvector  of  the  3x3  C  matrix  with  a  corresponding  eigenvalue  of 
+1.  If  the  3x3  C  matrix  has  an  eigenvalue  of  -1,  the  matrix  represents  a  reflection,  not  a  proper  ro¬ 
tation  and  the  principal  rotation  theorem  does  not  hold.  In  this  case  the  det(C)  would  be  +1.  The 
principal  rotation  vector  y  is  defined  as; 

Y  =  <()e 

Let  us  now  consider  the  case  where  a  rigid  body  performs  a  pure  single-axis  rotation  about  the 
fixed  i.  This  rotation  axis  is  identical  to  Euler’s  principal  line  of  rotation  e.  Let  the  rotation  angle 
be  ({).  The  angular  velocity  vector  for  this  case  becomes; 

d)  = 

or  in  matrix  form; 

[cb]  =  <j>[e] 


Substituting  equation  (8)  into  (1),  one  obtains  the  following  development. 

dt  dt 


dC 


=  -[e]C 


C  = 


(9) 


The  last  step  foUows  since  the  [e]  matrix  is  constant  during  this  single  axis  maneuver.  Due  to 
Euler’s  principal  rotation  theorem,  however,  any  arbitrary  rotation  can  always  be  described  instan¬ 
taneously  by  the  equivalent  single-axis  principal  rotation.  Hence  equation  (9)  will  hold  at  any  in¬ 
stant  for  an  arbitrary  time-varying  direction  cosine  matrix  C.  However,  (])  and  e  must  be  consid¬ 
ered  time-varying  functions.  Using  the  following  substitution 

[?]  = 


equation  (9)  can  be  rewritten  as  [2] 


Instead  of  using  an  infinite  matrix  power  series  expansion  of  equation  (11)  to  find  C,  the  ele¬ 
gant  finite  transformation  shown  below  can  be  used  [2].  That  is,  the  evaluation  of  e  does  not 
require  the  spectral  decomposition  of  [7],  but  can  be  written  directly  in  term  of  y  itself.  Unfortu¬ 
nately,  this  transformation  only  holds  for  the  3x3  case.  A  general  transformation  for  the  NxN 
case  is  unknown  at  this  point,  at  least  as  far  as  the  authors  know. 

g“[Yl  =/cos<j)- [e]sin(J)-ee^(cos(})  — 1)  q2 

(l)  =  llYll.  ^=Y/4> 

To  find  the  inverse  transformation  from  the  direction  cosine  matrix  C  to  [y]  ,  the  matrix  loga¬ 
rithm  can  be  taken  of  equation  (1 1)  to  obtain 

(Y]=-logC  =  Xi(/-C)”  (13) 

Using  the  spectral  decomposition  of  C  given  in  equation  (4),  the  above  equation  can  be  rewrit¬ 
ten  as 

[y]  =  -  log(t/At/* )  =  -  C/(logA)t/*  (14) 

where  calculating  the  matrix  logarithm  of  a  diagonal  matrix  becomes  trivial.  Since  all  eigen¬ 
values  of  an  orthogonal  matrix  have  unit  norm,  the  matrix  logarithm  in  equation  (14)  is  defined 
everywhere  except  when  an  eigenvalue  is  -1.  Generally,  equation  (14)  will  return  a  [y]  which 
corresponds  to  a  principal  rotation  angle  <|)  in  (-180°,+ 180°).  Note  however,  that  when  C  has  ei¬ 
genvalues  of  -1,  equation  (14)  does  not  return  a  skew-symmetric  matrix.  The  transformation 
breaks  down  here  for  this  singular  event.  The  geometric  interpretation  is  that  a  180  rotation  has 
been  performed  about  one  axis  (leading  to  one  positive  and  two  negative  eigenvalues  of  C),  which 
is  the  only  rotation  not  covered  by  the  domain  of  equation  (14). 

The  principal  vector  representation  of  C  is  not  unique.  Adding  or  subtracting  27c  from  the  prin¬ 
cipal  rotation  angle  (j)  describes  the  same  rotation.  As  expected,  equation  (11)  will  always  yield 
the  same  C  matrix  for  the  different  principal  rotation  angles,  since  all  angles  correspond  to  the 
same  physical  orientation.  However,  the  inverse  transformation  given  in  equation  (14)  yields  only 
the  principal  rotation  angle  which  lies  between  -180°  and  +180°. 

As  do  all  minimal  parameter  sets,  the  principal  rotation  vector  parameterization  has  a  singular 
orientation.  The  vector  is  not  uniquely  defined  for  a  zero  rotation  from  the  reference  frame.  The 
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principal  rotation  vector  parameterization  will  be  found  convenient,  however,  to  derive  useful  rela¬ 
tionships. 

The  Euler  (Quaternion)  Parameters 

The  Euler  parameters  are  a  once-redundant  set  of  rotation  parameters.  They  are  defined  in 
terms  of  the  principal  rotation  angle  (j)  and  the  principal  line  components  e.  as  follows. 

Po=cos|,  p,  =gisin|  I  =1,2,3  (15) 


They  satisfy  the  holonomic  constraint; 


Po  +  Pi  +  Pi  +  Pa  -  1 


(16) 


Equation  (16)  states  that  all  possible  Euler  parameter  trajectories  generate  arcs  on  the  surface 
of  a  four-dimensional  unit  hypersphere.  This  behavior  bounds  the  parameters  to  values  between 
±1.  However,  the  Euler  parameters  are  not  unique.  The  mirror  image  trajectories  P(t)  and  -P(t) 
both  describe  the  identical  physical  orientation  histories.  Given  a  3x3  orthogonal  matrix,  there 
will  be  two  corresponding  sets  of  Euler  parameters  which  differ  by  a  sign.  The  Euler  parameters 
are  the  only  set  of  rotation  parameters  which  have  a  bi-linear  system  of  kinematic  differential 
equations  [1],  other  than  the  direction  cosine  matrix  itself,  as  follows 


fPol 

P2 

iPaJ 


■Po  -Pi  -P2  -p3‘ 

rO  I 

1 

Pi  Po  “P3  P2 

.“1  . 

2 

P2  P3  Po  “pi 

0)2 

-P3  -P2  Pi  Po- 

vCu3 

(17) 


It  is  also  of  significance  that  the  above  4x4  matrix  is  orthogonal,  so  “transportation"  between 
0).’s  and  p,-  ‘s  is  “painless".  The  direction  cosine  matrix  in  term  of  the  Euler  parameters  is  [1,3] 


rp^+p?-pi-p3 


[C]  = 


2(PiP2-PoP3) 

.2(piP3+P0p2) 


2(PiP2  +P0P3) 
P§-Pl+P2-P3 
2(P2P3-PoPi) 


2(Plp3-p0p2) 

2(P2p3  +P0P1) 


(18) 


The  Euler  parameters  have  several  advantages  over  all  minimal  sets  of  rotation  parameters. 
Namely,  they  are  bounded  between  ±1,  never  encounter  a  singularity,  and  have  linear  kinematic 
differential  equations  if  the  CDj(t)  are  considered  known.  All  of  these  advantages  are  slightly  offset 
by  the  cost  of  having  one  extra  parameter. 
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The  Classical  Rodrigues  Parameters 

The  classical  Rodrigues  parameter  vector  q  can  be  interpreted  as  the  coordinates  resulting 
from  a  stereographic  projection  of  the  four-dimensional  Euler  parameter  hypersphere  onto  a 
three-dimensional  hyperplane  [6],  with  the  projection  point  at  the  origin  and  the  stereographic 
mapping  hyperplane  at  Pq  =  +1  •  As  discussed  in  [6],  it  follows  that  they  have  their  singular  orien¬ 
tation  at  a  principal  rotation  angle  of  <{>  =  ±180®  from  the  reference.  Their  transformation  from 
the  Euler  parameters  is 

i=  1,2,3  (19) 

po 

Unlike  the  Euler  parameters,  the  Rodrigues  parameters  are  unique.  The  q-  uniquely  define  a 
rotation  on  the  open  range  of  (-180®, +180®)  [6];  as  is  evident  in  equation  (19),  reversing  the  sign 
of  the  Euler  parameters  has  no  effect  on  the  q..  Using  equation  (15),  the  classical  Rodrigues  pa¬ 
rameters  can  also  be  defined  directly  in  terms  of  the  principal  rotation  angle  and  the  principal  axis 
components  as 

qi  =  e,tan  ^  i  =  1, 2, 3  (20) 


It  is  apparent  that  q  has  the  same  direction  as  the  principal  rotation  and  the  magnitude  is 
tan{if/2) .  The  singular  condition  of  ()>  =  ±180®  is  evident  by  inspection  of  equation  (20).  The 
kinematic  differential  equation  for  the  Rodrigues  parameters  contain  a  quadratic  nonlinear  depen¬ 
dence  on  the  9^-.  They  can  be  verified  from  equations  (17,20)  to  be  [1-4] 


_1 

2 


1+^^  qyqi-qz  qiqi+qi 
qiqi+q^  1  +  92  9293-91 

-9391  -92  9392  +  91  1+93 


(21) 


Notice  that  the  above  coefficient  matrix  is  not  orthogonal,  although  the  inverse  is  well  be¬ 
haved  everywhere  except  at  (J)  =  ±180®  where  |^|  — > '«.  The  direction  cosine  matrix  in  terms  of 
the  Rodrigues  parameters  is  [1-4]: 


C(9)  = 


1 

l+9i +92+93 


■l+9i -92-93 
2(9291  -93) 

-  2(^3 +^2) 


2(9192+93)  2(^1^3-^2) 

l-9i +92-93  2(9293+91) 
2(9392-91)  1-91  -92  +  93- 


(22) 


The  Modified  Rodrigues  Parameters 

The  modified  Rodrigues  parameter  vector  d  is  also  a  set  of  stereographic  parameters,  closely 
related  to  the  classical  Rodrigues  parameters  [2,4-6].  The  modified  Rodrigues  parameters  have 
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the  projection  point  at  (-1,0,0,0)  and  the  stereographic  mapping  hyperplane  at  =  0.  This  projec¬ 
tion  results  in  a  set  of  parameters  which  do  not  encounter  a  singularity  until  a  principal  rotation 
from  the  reference  frame  of  ±360°  has  been  performed.  Therefore  they  are  able  to  describe  any 
rotation  except  a  complete  revolution  ±360°.  Their  transformation  from  the  Euler  parameters  is 


While  the  classical  Rodrigues  parameters  have  a  singularity  at  pQ=0  ((j)  =  ±180°),  the  modified 
Rodrigues  parameters  have  moved  the  singularity  out  to  a  single  point  at  Pq=-1  (4>  =  )• 

ure  2  below  illustrates  these  two  singular  conditions.  Since  the  classical  Rodrigues  parameters  ate 
only  defined  for  - 180°  <^<  + 180° ,  they  can  only  describe  rotations  on  the  upper  hemisphere 
of  the  four-dimensional  unit  hyper-sphere  where  Pq>0.  However,  the  modified  Rodrigues  parame¬ 
ters  can  describe  any  rotation  on  this  hypersphere  except  the  point  Therefore  the  modified 

Rodrigues  parameters  have  twice  the  nonsingular  range  as  the  classical  Rodrigues  parameters. 


Fig.  2.:  Illustration  of  the  Singular  Conditions  of  the  Classical  and 
the  Modified  Rodrigues  Parameters. 

Like  the  Euler  parameters,  the  modified  Rodrigues  parameters  are  not  unique.  They  have  an 
associated  “shadow”  set  found  by  using  -p(t)  instead  of  P(t)  in  equation  (23)  [5,6].  The  transfor¬ 
mation  from  the  original  set  to  the  “shadow”  set  is  [2,5,6] 

/  =  1,2,3  (24) 

6  6 


The  “shadow”  points  are  denoted  with  a  superscript  S  merely  to  differentiate  them  from  CT/ . 
Keep  in  mind  that  both  dand  describe  the  same  physical  orientation,  similar  and  related  to  the 
case  of  the  two  possible  sets  of  Euler  parameter  and  the  principal  rotation  vector.  It  turns  out  that 
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the  modified  Rodrigues  “shadow”  vector  o^(t)  has  the  opposite  singular  behavior  to  the  original 
vector  d(0 .  The  original  parameters  have  differential  kinematic  equations  which  are  very  linear 
near  a  zero  rotation  and  are  singular  at  a  ±360°  rotation.  On  the  other  hand,  the  “shadow  ’  parame¬ 
ters  have  differential  kinematic  equations  which  are  linear  near  the  ±360°  rotation  and  singular  at 
the  zero  rotation.  [6]  Using  equation  (15),  the  definition  for  the  modified  Rodrigues  parameters  in 
equation  (23)  can  be  rewritten  as  [4] 


Equation  (25)  is  very  similar  to  equation  (20),  except  for  the  scaling  factor  of  the  principal  ro¬ 
tation  angle.  The  singularity  at  ±360°  is  evident  in  equation  (25),  and  small  rotations  behave  like 
quarter  angles.  All  three  parameter  representations  must  possess  a  singularity.  This  set  max¬ 
imizes  the  nonsingular  principal  rotation  range  to  ±360°.  The  following  differential  kinematic 
equations  display  a  similar  degree  of  quadratic  nonlinearity  as  do  the  corresponding  equations  in 
terms  of  the  classical  Rodrigues  parameters  [4-6] 

ri+Oi-C2-03  2(aia2-cy3)  2(0103+02) 

6  =  ^  2(0201+03)  I-O1+O2-O3  2(0203-01) 

L  2(0301-02)  2(0302+01) 


Note  that  the  coefficient  matrix  of  the  differential  kinematic  equation  is  not  orthogonal,  but  al- 
most.  Multiplying  it  with  its  transpose  yields  a  scalar  (l+o^o)  times  the  identity  matrix.  As 
far  as  we  know,  this  is  the  only  three  parameter  representation  possessing  this  elegant  property; 
further  attesting  to  the  uniqueness  and  importance  of  the  modified  Rodrigues  parameterization. 
This  almost  orthogonal  behavior  allows  for  a  simple  transformation  between  the  ©,•  and  the  a,- 


C(a)  = 


(1  +  a^d)^ 


4(o| -o^ -03) +  E^  80102 +403E 


802O1  -403L 
8O3O1  +  402^ 


80i03-402£ 
4(-Oj +02 -03) +  L^  802O3+401E 
803O2-401Z  4(-o|-o^  +  o|)+2:^ 


(27) 


L  =  1-0^0 


The  direction  cosine  matrix  is  shown  above  [6,9].  It  has  a  slightly  higher  degree  of  nonlinear¬ 
ity  than  the  corresponding  direction  cosine  matrix  in  terms  of  the  classical  Rodrigues  parameters. 


Parameterization  of  Proper  NxN  Orthogonal  Matrices 

A  proper  orthogonal  matrix  is  an  orthogonal  matrix  whose  determinant  is  +1.  Some  aspects 
of  parameterizing  proper  NxN  orthogonal  matrices  into  M-dimensional  Rodrigues  parameters 
have  been  studied  recently  by  Junkins  and  Kim  [1]  and  Shuster  [2].  Keep  in  mind  that  M  = 
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ViNCN-l).  These  classical  developments,  generalizing  the  Rodrigues  parameters  to  NxN  proper 
rotation  matrices,  date  from  the  work  of  Cayley  [7]  and  are  included  below  for  comparative  pur¬ 
poses  with  the  new  representations. 

Any  NxN  orthogonal  matrix  abides  by  the  constraint  given  in  equation  (3).  This  equation  is  an 
exact  integral  of  equation  (1),  as  can  be  verified  by  differentiation  of  equation  (3)  to  obtain 

C^C+C^C  =  0  (28) 

The  C  matrix  defined  in  equation  (1)  can  be  shown  to  satisfy  this  condition  exactly.  Substi¬ 
tute  equation  (1)  into  (27)  and  expand  as  follows 

(-  [d)]C)^C+C^(-  [&]€)=  0 
(-  C^[a)f)C-C^[6j]C  =  0 
C^(-[d)f -[cc>])C  =  0 

The  above  statement  is  obviously  satisfied  if  [a>]  is  a  skew-symmetric  matrix,  e.g. 
[g)]  =  —  [m]^  .  Consequently  equation  (1)  will  generate  an  NxN  orthogonal  matrix,  as  long  as 
[m]  is  skew-symmetric  and  the  initial  condition  C(t=0)  is  orthogonal.  This  observation  allows 
for  the  evolution  of  NxN  orthogonal  matrices  to  be  viewed  as  higher  dimensional  direction  cosine 
matrices,  somewhat  analogous  to  the  motion  generated  by  a  “higher  dimensional  rigid  body  rota¬ 
tion,”  and  also  suggests  parameterization  of  of  higher  dimensional  rigid  body-motivated  rotation 
parameters. 

Higher  Dimensional  Classical  Rodrigues  Parameters 

Cayley’s  transformation  [7]  parameterizes  a  proper  orthogonal  matrix  C  as  a  function  of  a 
skew-synunetric  matrix  Q\  these  elegant  transformations  are 

C  =  (/-  !2)(/+  QT^  =  (/+  QT^  (/-  Q)  (29a) 

(2=(/-C)(/+C)'^  =(/+C)"^(/-C)  (29b) 

The  Cayley’s  transformation  is  one-to-one  and  onto  from  the  set  of  skew-symmetric  matrices 
to  the  set  of  proper  orthogonal  matrices  with  no  eigenvalues  at  -1.  Notice  the  remarkable  truth 
that  the  forward  and  inverse  transformations  are  identical.  The  transformation  in  equation  (29b) 
fails  if  any  of  the  eigenvalues  of  C  are  -1,  because  the  I+C  matrix  becomes  singular  and  is  thus 
not  invertible.  The  Cayley  transformation  in  equation  (29a)  produces  only  proper  orthogonal  ma¬ 
trices  C  with  de/fC)=+l.  This  can  be  verified  by  examining  the  determinant  of  C  as  shown  below. 
Using  equation  (29a),  det{ C)  can  be  expressed  as 
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det(C)  =  det(/-  e)det((/+  QT'  )  = 


Since  the  Q  matrix  is  skew-symmetric,  it  has  purely  imaginary  complex  conjugate  pairs  of  ei¬ 
genvalues  of  the  form  ±iX..  Let  /?  be  the  corresponding  eigenvector  matrix  to  Q.  Multiplying  and 
dividing  the  above  equation  by  det(R)  yields 

det(/g)det(/-(2)/det(/?)  det(j?)det(/-Q)det(jr^) 

"  det(/?)det(/+  Q)/detiR)  det(/?)det(/-!-  (2)det(/?-i ) 

A  dct{R(I-Q)Rr^)  _dct{l-RQR-^) 

^  det(/?(/+0/r‘)  dct{I+RQR-^) 


where  the  RQIT^  term  is  a  diagonal  matrix  containing  the  eigenvalues  of  the  Q  matrix.  Since 
the  determinant  of  a  matrix  is  the  product  of  all  the  eigenvalues,  the  above  can  be  written  as 


=  + 1  q.e.d 


where  p  is  the  number  of  nonzero  (imaginary)  eigenvalues  of  Q.  The  above  statement  proves 
that  all  C  matrices  formed  with  equation  (29a)  are  indeed  proper  matrices.  For  the  3x3  case,  let 
the  Q  matrix  be  defined  as  the  following  skew-symmetric  matrix: 


Q  =  [q]  = 


-  0  -^3 
93  0 

--92  91 


92  ■ 
-91 

0  J 


(30) 


After  substituting  equation  (30)  into  (29a),  it  can  be  verified  that  resulting  C  matrix  is  indeed 
equal  to  equation  (22).  Cayley’s  transformation  (29)  is  a  generalization  of  the  classical  Rodrigues 
parameter  representation  for  NxN  proper  orthogonal  matrices  [1,2],  while  the  Q  matrix  gener¬ 
alizes  the  Gibbs  vector  in  higher  dimensions  [2,10]. 

Using  the  [y]  matrix  defined  in  equation  (14)  the  Q  matrix  can  be  expressed  as  follows  [2]: 

0  =  (31) 

The  above  transformation  can  be  verified  by  performing  a  matrix  power  series  expansion  of 
equation  (31)  and  substituting  it  into  a  matrix  power  series  expansion  of  equation  (29a).  The  re¬ 
sult  is  a  matrix  power  series  expansion  for  the  matrix  exponential  function  as  expected  from  equa¬ 
tion  (11).  However,  equation  (12)  cannot  be  used  to  calculate  the  matrix  exponentials,  since  this 
equations  only  holds  for  the  3x3  case.  Note  the  similarity  between  equation  (31)  and  (20).  Both 


calculate  the  Rodrigues  parameters  in  terms  of  half  the  principal  rotation  angle! 

The  differential  kinematic  equations  of  the  C  matrix  were  shown  in  equation  (1),  where  the 
skew-symmetric  matrix  [®]  is  related  to  Qand  Q  via  the  kinematic  relationship  [1] 

[&]  =  2(/+  (2r‘  (2(/-  QT^ 

or  conversely,  Q  can  be  written  as 

j^  =  ^(/+!2)[a>](/-G)  (33) 

The  equations  (32-33)  are  proven  to  hold  for  the  higher  dimensional  case  in  reference  1.  For 
NxN  orthogonal  matrices,  [©]  =  -  [co]^  represents  an  analogous  “angular  velocity’  matrix. 

Higher  Dimensiondl  Modified  Rodrigues  Pcrameters 

As  is  evident  above,  the  modified  Rodrigues  parameters  have  twice  the  principal  rotation 
range  as  the  classical  Rodrigues  parameters.  It  can  be  shown  that  the  higher  dimensional  mod¬ 
ified  Rodrigues  parameters  also  have  twice  the  nonsingular  domain  as  the  higher  dimensional 
classical  Rodrigues  parameters. 

To  find  a  transformation  from  the  NxN  proper  orthogonal  matrix  C  to  the  modified  Rodrigues 
parameters,  let  us  first  examine  what  happens  when  taking  the  matrix  square  root  of  C.  Let  the 
square  root  matrix  W  be  defined  by  the  necessary,  but  not  sufficient  condition 

WW=C  (34) 

Obviously,  for  the  general  NxN  case,  there  will  be  many  W  matrices  that  satisfy  equation  (34). 
Using  the  spectral  decomposition  of  C  given  in  equation  (4),  the  spectral  decomposition  of  W  can 
be  written  as 

W=  Uy/Xu*  (35) 

Since  the  C  matrix  is  orthogonal,  all  the  eigenvalues  in  A  must  have  unit  magnitude.  Keep  in 
mind  that  the  A  matrix  in  equation  (35)  is  diagonal  and  that  the  matrix  square  root  is  trivial  to  cal¬ 
culate.  Since  taking  the  square  root  of  an  eigenvalue  with  unit  magnitude  results  in  another  ex¬ 
pression  with  unit  magnitude,  the  W  matrix  itself  is  unitary,  or  orthogonal  if  all  entries  are  real.  It 
turns  out  that  W  is  always  real  and  orthogonal,  as  long  as  no  eigenvalue  of  C  is  -1.  If  an  eigen¬ 
value  of  C  is  -1,  then  W  has  complex  values  and  is  a  unitary  matrix.  The  product  of  all  eigenval¬ 
ues  of  C  is  the  determinant  of  C  and  must  be  +1  since  C  is  proper.  For  even  dimensions  of  C,  the 
eigenvalues  must  all  be  complex  conjugate  pairs  for  the  det(C)  to  be  +1.  For  odd  dimensions,  the 


extra  eigenvalue  must  be  real  and  +1  in  order  for  the  matrix  to  be  proper. 

Each  time  a  square  root  is  calculated,  there  are  two  possible  solutions.  If  the  eigenvalue  in 
question  is  one  of  the  complex  conjugate  pairs,  then  the  sign  does  not  matter  for  W  to  be  a  proper 
matrix.  If  the  matrix  dimension  is  odd,  then  the  root  of  the  extra  eigenvalue  must  be  +1  for  W  to 
be  proper.  In  the  3x3  case  there  is  only  one  complex  conjugate  pair  of  eigenvalues.  Hence  only 
two  W  matrices  satisfy  the  above  conditions.  This  is  to  be  expected,  since  any  three-dimensional 
rotation  can  be  described  by  two  principal  rotation  angles  which  differ  by  2n,  one  of  which  is  pos¬ 
itive  and  the  other  is  negative.  To  make  the  choice  of  W  unique,  let  us  select  all  the  roots  of  the 
complex  conjugate  pairs  to  have  a  positive  real  part. 

Since  the  W  matrix  is  orthogonal,  with  one  exception,  it  has  a  principal  line  and  angle  asso¬ 
ciated  with  it.  If  the  C  matrix  had  an  eigenvalue  of  -1,  the  same  numerical  problems  arise  as  we 
encountered  with  finding  the  principal  rotation  vector.  Multiplying  W  with  itself  in  equation  (34) 
simply  doubles  the  principal  angle,  but  leaves  the  principal  line  unchanged.  Therefore  W  repre¬ 
sents  a  rotation  about  the  same  principal  line  as  C,  but  with  half  the  principal  angle.  This  pro¬ 
vides  conceptually  elegant  interpretations  of  the  square  root  of  C  as  defined  above.. 

For  three-dimensional  rotations,  the  simple  restriction  on  the  square  roots  of  the  eigenvalues 
can  be  shown  to  restrict  the  principal  rotation  angle  to  satisfy  — 180°  <  (jx  -i- 180  .  This  choice  is 
consistent  with  many  numerical  matrix  manipulation  packages  and  their  computation  of  a  square 
root  of  a  matrix.  Let  the  j-th  complex  conjugate  eigenvalue  of  C  be  denoted  as  ,  where  the 
the  phase  is  -  180°  <  0y  <  +  180°.  If  the  dimension  N  is  an  odd  number,  W  has  the  stracture 
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If  the  dimension  N  is  even,  then  W  is 


W=U- 
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(36) 


(37) 


Using  the  parameterization  given  in  equation  (11),  the  matrix  W  can  also  be  written  directly  in 


terms  of  the  principal  rotation  matrix  [y]  as  follows 

(38) 

This  solution  for  W  can  be  verified  by  substituting  it  into  equation  (34).  Comparing  equation 
(38)  with  equation  (11)  it  becomes  obvious  that  the  W  matrix  has  indeed  the  same  pnnciple  rota¬ 
tion  direction  as  C,  with  half  the  principle  angle.  Since,  for  three-dimensional  rotations,  there  are 
two  possible  principal  angles  for  a  given  attitude,  there  are  two  possible  solutions  for  equation 
(38).  Again,  by  keeping  |<|>|  <  180°,  the  same  W  matrix  is  obtained  as  with  the  matrix  square  root 

method  discussed  above. 

Remember  that  the  modified  Rodrigues  parameters  have  a  nonsingular  range  corresponding 
to  |(til  <  360°.  Since  W  is  the  direction  cosine  matrix  corresponding  to  half  of  the  principal  rota¬ 
tion  angle  of  C,  the  resulting  nonsingular  range  of  the  W  matrix  has  been  reduced  to  \^\  <  180  . 
This  is  the  same  nonsingular  range  as  the  classical  Rodrigues  parameters.  Therefore  the  Cayley 
transformations,  defined  in  equations  (29a,b),  can  be  applied  to  W.  Let  S  be  the  skew-symmetric 
matrix  composed  of  the  modified  Rodrigues  parameters,  similar  to  the  construction  of  the  Q  ma¬ 
trix  in  equation  (30).  Then  the  transformation  from  W  to  5  and  its  inverse  are  given  as; 

W=iI-SKI+S)~^  =  (7+5)"^  (7-5)  (39a) 

S  =  (7-  WKI+  Wr^  =  (7+  W)-i  (7-  W)  (39b) 

Using  equation  (39a)  and  (34),  a  direct  transformation  from  5  to  C  is  found. 

C  =  (7-S)^(7-l-S)"^  =  (7+5)"^(7-5)^  (40) 

This  direct  transformation  is  very  similar  to  the  classical  Cayley  transform,  but  no  elegant  di¬ 
rect  inverse  exists  (i.e.  we  lose  the  elegance  of  equation  (29b);  no  analogous  equation  can  be  writ¬ 
ten  for  S  as  a  function  of  C).  This  is  due  to  the  overlapping  principal  rotation  angle  range  of 
±360°  causing  the  transformation  in  equation  (40)  not  to  be  injective  (one-to-one).  Since  the  clas¬ 
sical  Rodrigues  parameters  are  for  principal  rotations  between  (-180°,+!  80°),  they  have  a  unique 
representation  and  the  Cayley  transform  has  the  well  known  elegant  inverse. 

However,  an  alternate  way  to  obtain  the  S  matrix  from  the  C  matrix  is  available  through  the 
skew-symmetric  matrix  [y]  defined  in  equation  (14). 

S  =  -  tanh(^|^j  =  -  (41) 

The  transformations  given  in  equation  (41)  can  be  verified  by  performing  a  matrix  power  se- 


ries  expansion  and  back-substituting  it  into  equation  (40).  Note  again  the  similarity  between  equa¬ 
tion  (41)  and  equation  (25).  The  principal  rotation  angle  is  divided  by  four  in  both  cases. 

Either  the  W  or  the  [y]  matrix  can  be  solved  from  the  proper  NxN  orthogonal  C  matrix  to  ob- 
tain  the  corresponding  S  matrix.  Neither  method  is  as  elegant,  however,  as  equation  (29b)  of  the 
Cayley  transformation.  The  method  using  the  [y]  matrix  has  the  advantage  that  [y]  is  found  by 
taking  the  matrix  logarithm  of  the  eigenvalues  of  the  C  matrix  as  shown  in  equation  (14),  The 
uniqueness  questions  do  not  arise  here  as  in  the  matrix  square  root  method  because  solutions  are 
implicitly  restricted  to  proper  rotations  with  l(j>I  <180°.  Both  methods  produce  the  same  results 
using,  for  example,  the  matrix  exponential  and  matrix  square  root  algorithms  available  as  MAT- 
LAB  or  MATHEMATICA  operators.  Note  that  both  the  classical  and  the  “updated"  Cayley  trans¬ 
form  have  numerical  problems  when  transforming  a  proper  orthogonal  matrix  C  into  a 
skew-symmetric  matrix  if  C  has  eigenvalues  of  -1. 

Since  each  set  of  modified  Rodrigues  parameters  has  its  associated  “shadow”  set  [6],  it  is  usu¬ 
ally  not  important  which  S  parameterization  one  obtains,  as  long  as  at  least  one  valid  S  matrix  is 
found.  Once  a  parameter  set  is  found,  either  the  original  ones  or  the  “shadow”  set,  it  is  trivial  to 
remain  with  this  set  during  the  forward  integration  of  the  differential  equations  governing  the  evo¬ 
lution  of  S. 

The  differential  kinematic  equations  for  S  are  not  written  directly  from  C  as  they  were  with 
the  classical  Cayley  transform.  Instead  W  is  used  to  describe  the  kinematics  of  the  NxN  system. 
The  relationship  between  IV  and  S  is  the  same  as  between  C  and  Q.  Therefore  the  same  equations 
can  be  used.  The  differential  kinematic  equation  for  IV  is: 

lV  =  -p]lV  (42) 

where  the  skew-symmetric  matrix  [m]  is: 

[Ci]=2(I+S)-H{I-S)-^  (43) 

or  conversely  could  be  defined  as: 

5  =  i(/+5)[Q](/-5)  (44) 

Equation  (34)  can  be  used  during  the  forward  integration  to  obtain  C(t).  The  time  evolution  of 
C  in  terms  of  IV  and  [Q]  is: 

c = -  [Q]iviv-  w[Q]iv=  -  [Q]c-  iv[a]iv 

Equating  equation  (45)  and  (1),  the  direct  transformation  from  [Q]  to  [m]  is: 


(45) 


(46) 


[(b]  =  [Q]  +  W[Cl]W'^ 

To  verify  that  equation  (46)  yields  a  skew-symmetric  matrix  [m]  ,  the  definition  of  a 
skew-symmetric  matrix  is  used: 

[a]=-(af =-([S]  +  vv[fi]H'''f 

[a>]  =  [a]  +  W[Q]W^  q.e.d. 

Although  this  new  parameterization  is  somewhat  more  complicated  than  the  classical  parame¬ 
terization  into  M-dimensional  Rodrigues  parameters,  the  complications  arise  only  when  setting  up 
the  parameterization  in  terms  of  5.  Once  an  5  matrix  and  a  corresponding  W  matrix  have  been 
found,  this  method  is  no  different  from  the  classical  method.  The  important  improvement  is  that 
the  range  of  possible  principle  rotations  has  been  doubled  over  the  classical  M-dimensional  Ro¬ 
drigues  parameters. 


A  Preliminary  Investigation  of  Higher  Dimensional  Euler  Parameters 

The  classical  Euler  parameters  stood  apart  from  the  other  parameterizations,  because  they 
were  bounded,  universally  nonsingular  and  had  an  easy-to-solve  bi-linear  differential  kinematic 
equations.  All  of  these  attractive  features  were  only  slightly  affected  by  the  cost  of  increasing  the 
dimension  of  the  parameter  vector  by  one.  These  classical  Euler  parameters  are  extended  below 
to  higher  dimensions,  where  they  will  retain  some,  but  not  all,  of  the  above  desirable  features. 

The  Rodrigues  parameters  and  the  Euler  parameters  are  very  closely  related  as  seen  in  equa¬ 
tion  (19).  They  are  identical  except  for  the  scaling  term  of  Pq.  The  classical  Rodrigues  parame¬ 
ters  have  been  shown  to  expand  to  the  higher  dimensional  case  where  they  parameterize  a  NxN  or¬ 
thogonal  matrix  C  [1].  Analogous  to  equation  (19),  they  can  always  be  described  as  the  ratio  of  a 
once-redundant  set  of  parameters. 


9/  = 


k 

Po 


i  =  1,2,3,...,  M  = 


NiN-l) 


(47) 


The  skew-symmetric  matrix  Q  in  equation  (29a)  can  be  written  as: 


(48) 


where  B  is  a  NxN  skew-synunetric  matrix  containing  the  numerators  P/  of  Q.  For  the  three 
dimensional  case,  this  matrix  is  the  “vector"  part  of  the  classical  Euler  parameters  Pj,  P2,  P3.  ai^d 


has  the  familiar  structure 


f  0  -p3  P2  ■ 
B=  P3  0  -Pi 

-~P2  Pi  ®  - 


(49) 


Substituting  the  transformation  relating  Q  to  {Pq.Pi.-.Pj^I.  as  given  in  equation  (48)  the  Cay¬ 
ley  transform  of  equation  (29a)  results  in  the  following 

C=(po/-B)(Po/+B)"^ 

C(po/+B)  =  (Pof-B) 

(/-C)po-(/+C)B  =  0  (50) 


Equation  (50)  represents  an  NxN  system  of  linear  equations  in  {Pq’Pi’  -’Pm^' 
[N\(M+1)]  matrix  A  represent  the  linear  relationship  between  the  Pj. 


A- 


rPoi 

Pi 


=  0 


LPmj 


(51) 


Clearly  the  set  of  all  possible  higher  dimensional  Euler  parameters  spans  the  kernel  of  A.  We 
know  that  the  M  Rodrigues  parameters  are  a  minimal  set  to  parameterize  the  orthogonal  NxN  ma¬ 
trix  C.  By  adding  the  scaling  factor  Pq,  a  once  redundant  set  of  parameters  has  been  generated. 
Even  though  there  are  N^  linear  equations  in  equation  (50),  the  dimension  of  the  range  of  A  is 
only  M.  The  problem  is  still  under  determined.  The  dimension  of  the  kernel  of  A  must  be  one, 
since  only  one  additional  term  was  added  to  a  minimal  set  of  rotation  parameters.  The  solution 
space  is  a  multi-dimensional  line  through  the  origin. 


Fig.  3:  Solution  of  the  Higher  Dimensional  Euler  Parameters. 
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After  finding  the  kernel  base  vector,  an  infinite  number  of  solutions  still  exist.  Another  con¬ 
straint  is  needed.  Let  us  set  the  norm  of  the  higher  dimensional  Euler  parameter  vector  to  be 
unity.  This  concept  is  illustrated  in  Fig.  3  above. 

P5+P?  +  -+Pm  =  i 

Equation  (52)  is  the  higher  dimensional  equivalent  of  the  holonomic  constraint  of  the  classi¬ 
cal  Euler  parameters  introduced  in  equation  (16). 

Two  solutions  are  found  scaling  the  base  vector  of  the  kernel  of  A  to  unit  length.  Just  as  with 
the  classical  Euler  parameters,  any  point  on  the  multi-dimensional  Euler  parameter  unit  sphere  de¬ 
scribes  the  same  physical  orientation  as  its  antipodal  pole.  Therefore  the  higher  order  Euler  pa¬ 
rameters  are  not  unique,  but  contain  a  duality.  This  is  exactly  analogous  to  the  classical  case. 
This  duality  does  not  pose  any  practical  problems,  except  under  one  circumstance  discussed 

below. 

C=  (po/-i5)(po/+5)"^  =(po/+5)"‘(pof-5)  (53) 

The  inverse  transformation  from  higher  order  Euler  parameters  to  the  orthogonal  matrix  C  is 
found  by  using  Q  from  equation  (48)  in  the  classical  Cayley  transform.  The  result  is  shown  in 
equation  (53).  Using  a  B.  as  shown  in  equation  (49)  for  the  three-dimensional  case,  in  equation 
(53)  results  in  the  same  transformation  as  given  in  equation  (18).  Observe  that  the  inverse  trans¬ 
formation  has  a  singularity  when  Po  is  zero.  This  singularity  is  a  mathematical  singularity  only. 
Contrary  to  the  Rodrigues  parameters,  the  higher  order  Euler  parameters  are  well  defined  at  this 
orientation.  After  an  appropriate  skew-symmetric  matrix  B  is  constructed  and  carrying  out  the  al¬ 
gebra  in  equation  (53),  a  closed  form  algebraic  transformation  is  found 

For  the  2x2  case,  the  B  matrix  is  given  by 


B 


ro  -Pii 

iPi  0  J 


(54) 


Using  the  B  defined  above  in  equation  (53),  the  2x2  direction  cosine  matrix  C  is: 


CtxI  = 


pQ-Pi  2PoPo 

-2PoPo  Po~Pi 


(55) 


The  2x2  C  matrix  contains  no  polynomial  fractions  and  is  easy  to  calculate.  To  find  the  direc¬ 
tion  cosine  matrix  for  the  3x3  case,  use  the  B  matrix  defined  in  equation  (5 1)  in  equation  (53). 
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Cij3 


1 

"MP^P?+P1+P3) 


■Po(Pg  +  P?-^-P3) 
2po(Plp2-PoP3) 

.  2Po(PiP3 +P0P2) 


2Po(PiP2  +  P0P3) 
Po(P§-P?+P2-P3) 
2Po(p2p3-PoPl) 


2po(PlP3-Pop2)  ■ 
2P0(P2P3+P0P1) 

Po(P§-P?-Pl+Pi)J 


After  making  the  obvious  cancellations  and  enforcing  the  holonomic  constraint  equation,  the 
well  known  result  is  found  which  represents  the  3x3  direction  cosine  matrix  as  a  function  of  the 
classical  Euler  parameters  as  given  in  equation  (18).  This  classical  representation  contains  no 
polynomial  fractions  and  no  singularities,  just  as  was  the  case  with  the  2x2  system. 

For  dimensions  greater  than  3x3’s,  however,  the  algebraic  transformation  contains  polynomial 
fractions.  The  nice  cancelations  that  occur  with  a  2x2  and  a  3x3  orthogonal  matrices  do  not  occur 
with  the  higher  dimensions.  This  might  have  been  anticipated,  because  [2]  it  is  well-known  that 
quaternion  algebra  does  not  generalize  fully  to  arbitrary  higher-dimensional  spaces,  and  the  ele 
gant  classical  Euler  parameter  results  are  essentially  manifestations  of  quaternion  algebra.  To 
find  C4x4  in  terms  of  the  higher  dimensional  Euler  parameters,  we  define  the  4x4  B  matrix  as: 


0  -^6  Ps  “P4' 

P6  0  -p3  P2 

-P5  P3  0  -Pi 

P4  -p2  Pi  0  J 


(56) 


and  substitute  it  into  equation  (53),  this  leads  to 


’Po(Po  ■1‘Pl  ■’■Pz  +  Pi  “P4  “Ps  “PD  2Po(Po(P2p4  +  P3P5  +PoP6)  +  Pl5) 

2Po(Po(p2p4  +  p3p5-PoP6)-Pl5)  Pi(Pi  +  Pi-Pl-Pi  +  p4+p5-P6)-S"  ... 

2P0(P0(P0P5  +  P3P6  -  Pi P4)  -  P25)  2po(Po(Pl  P2  -  P0P3  +  PsPs)  “  p45) 

.  2po(Po(-Pop4-PlP5-p2p6)-p35)  2P0(P0(P1P3  +  P0P2-P4P6)-P58) 

2po(Po(“  PoP5  PsPs  “  Pi  P4)  +  P28)  2Po(Po(PoP4  ~  Pi  Ps  “  PzPe)  +  P38) 

2p0(p0(plp2  +  p0p3+psp6)  +  p45)  2po(po(plp3-p0p2-p4p6)  +  ps5) 

Po(Po“Pl‘l'P2“p3‘''p4“P5‘‘'P6)“^^  2po(Po(PoPl  +p4pS +P2P3)  +  P65) 

2po(Po(-  PoPi  +  P4ps  +  P2P3)  -  PsS)  p§(p§  -  Pi  -  Pi  +  Pi  -  P4  +  Ps  +  Pi)  -5"' J 


with  5  =  P3P4 +  Pip6  “PzPs 

A  =  P§+52 


This  denominator  A  can  vanish  for  several  P-  configurations.  Observe,  however,  that  when¬ 
ever  A  is  zero,  so  is  the  numerator.  For  each  singular  case  we  can  confirm  that  a  finite  limit  ex¬ 
ists,  as  v/as  to  be  expected,  since  the  original  orthogonal  C  matrix  was  finite.  In  all  cases  Po  =  0 
is  a  prerequisite  for  a  (0/0)  condition  to  occur.  Finding  the  transformations  for  matrices  with  di¬ 
mensions  greater  than  4x4  would  show  the  same  behavior.  Po  =  0  is  always  a  indicator  that  a 
mathematical  singularity  may  occur.  In  none  of  these  cases  are  the  higher  dimensional  Euler  pa¬ 
rameters  themselves  actually  singular.  It  is  always  a  mathematical  singularity  of  the  transforma- 
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tion  itself.  To  circumvent  this  problem  for  particular  applications,  the  limit  of  the  fraction  can  be 
found  as  Po  0.  After  substituting  Po  =  0  equation  (57),  for  example,  most  fractions  be¬ 
come  trivial  and  the  matrix  is  reduced  to 


('4x4  — 


r-1 

0 

0 

0 


0 

-1 

0 

0 


0  0-1 

0  0 

-1  0 

0-1-1 


=  -hx4 


(58) 


Substituting  Po  =  0  into  equation  (55)  yields  the  same  result.  Actually,  as  long  as  C  is  of  even 
dimension  the  matrix  will  be  -/  if  Po  =  0.  If  the  dimension  is  odd,  as  it  is  for  the  3x3  case,  the  C 
matrix  will  be  fully  populated.  With  this  observation  it  is  easy  to  circumvent  the  singular  situa¬ 
tions  if  the  dimension  is  even.  If  the  dimension  is  odd  a  numerical  limit  must  be  found.  In  either 
case  the  transformation  will  be  well  behaved  everywhere  except  the  po  =  0  surface.  The  fact  that 
the  0/0  condition  can  be  resolved  analytically  to  obtain  finite  limits  should  not  obscure  the  frus¬ 
trating  fact  that  these  0/0  conditions  would  pose  numerical  difficulties  in  general  numerical  algo¬ 
rithms. 

Let  us  examine  the  uniqueness  of  the  transformation  given  in  equation  (53).  Assuming  that 
the  transformation  is  not  unique,  two  possible  higher  dimensional  Euler  parameter  sets  Pand  ^ 
are  chosen,  these  parameterize  C  as 


Subtracting  one  equation  from  the  other  the  following  condition  is  obtained: 

o=(M-^)(w+sr‘-(w+fir‘(M-^) 

0  =  (Po/+ ^)  -  (fc'-  ^)(Po' + ^) 


o=M-M 


or 


Po  ^0 


(59) 


Equation  (59)  is  the  necessary  condition  for  two  higher  order  Euler  parameter  sets  to  yield  the 
same  direction  cosine  matrix  C.  Obviously,  for  Po  0  this  can  only  occur  when 
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B  =  kB 

Po=*'Po 


(60) 


where  it  is  a  scalar.  This  condition  apparently  yields  an  infinite  number  of  solutions.  But 
since  the  higher  dimensional  Euler  parameters  must  satisfy  the  holonomic  constraint  given  in 
equation  (52),  only  unit  scaling  values  of  k  are  permissible.  Therefore  k  must  be  either  ±1.  The 
above  uniqueness  study  results  in  exactly  the  same  duality  as  is  observed  with  the  classical  Euler 
parameters,  except  the  restriction  on  Po  0.  There  are  always  two  possible  sets  of  classical  Euler 
parameters  which  describe  an  orthogonal  3x3  matrix  C.  It  is  evident  that  this  truth  extends  to  the 
more  general  case  of  NxN  orthogonal  matrices  .  This  duality  was  seen  earlier  when  applying  the 
holonomic  constraint  to  the  kernel  of  A. 


CiVxA/[P(0]  =  C'N.tv[-P(0] 


(61) 


Based  on  the  above,  if  po  =  0  nothing  can  be  said  about  the  transformation  uniqueness.  As 
was  seen  with  the  4x4  C  matrix,  the  Po  =  0  condition  permits  any  point  on  the  unit  sphere 

Having  established  the  forward  and  backward  transformations  between  the  NxN  orthogonal 
matrices  and  the  higher  order  Euler  parameters,  their  kinematic  equations  are  also  of  interest.  To 
describe  the  orthogonal  matrix  C  as  a  generalized  rigid  body  rotation,  C  must  satisfy  a  differential 
equation  of  the  form  given  in  equation  (1).  After  substituting  equation  (48)  into  equation  (33),  Q 


IS 


(62) 


After  differentiating  equation  (48)  directly,  Q  is  found  to  be 

A  ppg-M  ’ 
ft2 


(63) 


Upon  substituting  equation  (62)  into  equation  (63)  and  after  making  some  simplifications,  the 
following  kinematic  relationship  is  found. 


m-m=|(Po^+^)[®kpo/-5) 


(64) 


This  equation  can  be  solved  for  the  skew-symmetric  angular  velocity  matrix  [m] . 

[to]  =  2(po/+  B)-^  (PoB  -  PoB)(Po/-  B)'*  (65) 

Note  that  this  equation  contains  the  same  mathematical  singularity  at  Po  =0  as  did  equation 
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(53).  Carrying  out  the  algebra  a  closed  form  algebraic  equation  is  found  for  the  higher  order  an¬ 
gular  velocities. 

Let  us  verify  that  equation  (65)  for  the  angular  velocities  does  indeed  generate  a 
skew-symmetric  matrix.  This  is  easily  accomplished  using  the  definition  of  a  skew-symmetric 
matrix  as  follows 

[m]  =  — =-2((Po^+^)  ) 

[ffl]  =  -  2(Po/- 

(ml  =  -2(Po/^  +  «’■)'' 


Since  the  matrix  B  and  its  derivative  are  skew-symmetric  matrices  by  definition,  further  sim¬ 
plifications  are  possible  to  obtain  the  following  result 

[©]  =  -  2(po/+ (-  Po^  +  Po5)  (Po/- 

[m]  =2(po/+5)"‘(Po5-M)(Po^-^r^ 

All  higher  order  Euler  parameter  differentials  must  abide  by  the  derivative  of  the  constraint 
equation  (52). 

2$oPo+2fiPl+-+2|3MP«=0 


After  using  the  B  from  equation  (49)  the  linear  differential  kinematic  equations  of  the  classi¬ 
cal  Euler  parameters  are  found.  To  verify  that  equation  (65)  generalizes  correctly,  known  classi¬ 
cal  results  let  us  verify  two  special  cases.  For  the  2x2  case,  a  scalar  differential  kinematic  equa¬ 
tion  results  from  equation  (65)  as 


0)1  =2[-pi 


Po] 


Po 

Pi 


(67) 


Adding  the  constraint  in  equation  (66),  equation  (67)  can  be  padded  to  make  it  full  rank. 


po  Pr 

fPol 

-pi  Po. 

ipll 

(68) 


Note  that  as  with  the  3x3  case,  the  matrix  transforming  p  to  o)  is  orthogonal  for  the  2x2  case. 
Therefore  the  inverse  transformation  can  be  written  as: 


It  is  straight  forward  to  show  that  equations  (65)  and  (66)  give  equation  (17)  for  the  3x3  case. 
Analogous  to  the  3x3  case,  the  above  differential  kinematic  equation  for  the  2x2  case  is  also 
bi-linear.  As  with  the  4x4  and  greater  direction  cosine  matrices,  for  proper  orthogonal  matrices 
having  dimensions  greater  than  3x3  the  higher  dimensional  differential  kinematic  equations  also 
contain  polynomial  fractions.  Using  the  B  matrix  from  equation  (56)  in  equation  (65)  and  collect¬ 
ing  all  the  angular  velocity  term,  we  find  the  differential  kinematic  equations  for  the  4x4  case 


fO  1 
(0, 

©2 

<  ©3  • 

©4 

©5 

>.©6-> 


APo  AP,  Ap2 

P6(P2P5-P3P4)-P1(P0  +  PD  Po(Po  +  p6)  Po(Pop3-p5P6) 

P5(PlP6  +  P3P4)-P2(Po  +  p5)  -Po(PoP3 +P5p6)  Po(P2  +  P5) 
P4(P2P5-Plp6)-P3(Po  +  P4)  Po(p4P6  +  Pop2)  “  Po(PoPl  +  p4p5) 
P3(P2P5-P1P6)-P4(P5  +  PD  Po(-PoPs  +  p3p6)  "  Po (Pop6  +  Psp3 ) 
p2(PlP6+p3P4)-P5(P^0  +  Pl)  Po(Pop4-p2P6)  Po(P3p4  +  Pl  p6) 
-P1(P2P5-P3P4)-P6(PS  +  P?)  P0(P2P5-P3P4)  Po(Pop4  "  Pi  Ps) 


Apj  AP4  APj  APg 

Po(p4p6-Pop2)  Po(Pop5  +  p3p6)  “  Po(Pop4  +  p2p6)  Po(p2p5  "  P3P4) 

Po(PoPl  -  P4p5)  Po(PoP6  -  P5P3)  Po(P3p4  +  Pi  p6)  "  Po(Pop4  +  PlPs) 
•  Po(PS  +  PD  P0(P2P5-P1P6)  P0(P0P6-P2P4)  Po(Pl  p4  -  PoPs) 

Po(P2p5-Plp6)  Po(P5  +  Pi)  Po(PoPl-p2P3)  Po(Pop2  +  Pl  p3 ) 
-Po(Pop6  +  P2p4)  -Po(PoPl+p2p3)  Po(PS  +  Pl)  Po(Pop3  "  Pl p2 ) 

Po(Plp4  +  Pop5)  Po(-PoP2+Plp3)  -Po(PoP3  +  PlP2)  Po(P^  +  PD 


with  A  =  Po  +  (P3P4  -  P2P5  +  Pl  p6)^ 


[Pol 

P, 

P2 

P3 

P4 

P5 

IPoJ 


(70) 


Note  that  this  transformation  matrix  is  no  longer  orthogonal  as  were  the  corresponding  ma¬ 
trices  for  both  the  2x2  and  3x3  cases.  The  bi-linearity  found  for  2x2  and  3x3  cases  is  also  lost  for 
the  higher  dimensional  cases.  Equation  (70)  has  the  same  denominator  as  the  4x4  direction  co¬ 
sine  matrix.  Hence  it  contains  the  identical  singular  situations.  However,  if  Po  =  0 ,  the  above 
transformation  matrix  is  singular  and  cannot  be  inverted! 

Thus  the  higher  dimensional  Euler  parameters  lose  some  key  properties  as  they  are  general¬ 
ized  to  parameterize  higher  dimensioned  proper  orthogonal  matrices.  They  retain  the  properties 
of  being  bounded  and  mapping  all  rotations  onto  arcs  on  a  unit  hypersphere.  However,  the  kine¬ 
matic  transformations  and  orthogonal  matrix  representations  loose  the  elegance  of  their  classical 
3x3  counterparts.  In  particular,  Po  =  0  poses  several  unresolved  issues  for  all  dimensions  higher 
than  3x3. 


Conclusion 

The  principal  rotation  parameterizations  presented  show  great  promise  as  an  elegant  means 
for  describing  the  evolution  of  NxN  orthogonal  matrices.  The  modified  Rodrigues  parameters  are 


only  slightly  more  complicated  than  their  classical  counterparts,  but  double  the  nonsingular  rota¬ 
tion  domain  The  (M+l)-dimensional  Euler  parameters  retain  some  of  the  desirable  features  of 
their  classical  counterparts.  However,  for  orthogonal  matrices  greater  than  3x3  though,  the  or¬ 
thogonal  matrix  representation  formulas  and  the  corresponding  differential  kinematic  equations 
contain  some  mathematical  singularities  which  require  taking  the  limits  of  polynomial  fractions. 
The  computational  effort  for  calculating  the  higher  dimensional  Euler  parameters  grows  rapidly 
when  increasing  the  dimension  of  the  C  matrix.  For  higher  dimensional  rotations,  the  modified 
Rodrigues  parameters  show  the  greatest  promise.  The  gain  (increased  nonsingular  domain  in 
comparison  to  the  classical  Cayley  transformation),  significantly  outweighs  the  extra  computa¬ 
tion. 
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Intrnriiiction/Motivation 


Consider  the  Optimal  Control  Problem: 
Find  u(t)  such  that  the  solution  of 

x  =  f(t,x.u)  .x(to)  specified 


extremizes 


subject  to 


j  =  (l)+  jF{t,x,u)cit 


ti 


T(t,.x(t,))-0 


Two  Approaches  to  Solution  : 


.  Function  Space  Approach 
Take  Variation 

=>  Pontryagin’s  Principle  &  TPBVP 


.  Parameterize  u(t)  = 

Optimize  (WpW2,-**,w^) 

via  Nonlinear  Programming 


Consider  the  System  of 

x  =  f(t,x.u) 

with  radial  basis  function  approximation 


u  =  |;Wie 


2I  a. 


Then  the  system  becomes 

x  =  f(t,x,w) 

Let’s  consider  the  matrix  of  partial  derivative. 


which  satisfies 

I WI.  1.)!  -  [AWIfd  t.)] + t.)]  =  [0] 


where 


Thus,  the  original  system  can  be  represented 
by  augmented  system 

z  =  r(t,x,w) 

where 


The  solution  to  this  dynamical  system  ; 

Ay  =  A  Aw 


We  use  minimum  norm  correction  algorithm. 

Aw  =  A'^(a  A'^)''Ay 


where 


and 


A  Step  size  limitation  filter  according 
to  the  value  of  Aw  is  used  as  follows; 

^new  ^  ^old 


where 

jAw]  =  a/aw^Aw 


If  Aw  <  8  for  acceptably  small  e,  then 

Aw=  A'^(a  ^Ay 
else  if  Aw  >  e  for  acceptably  small  s,  then 

r* 

Aw  = 


Aw 


A‘'(aaT)"  Ay 


e 


4 
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Even  after  the  terminal  constraints  are  met 
we  generally  do  not  know  whether  how  near 
the  performance  is  to  optimal. 

To  drive  the  performance  value  toward  the 
optimal,  we  introduce  a  homotopy  concept. 


XS  +  {\-X) 


J 


current 


Since  the  homotopy  concept  is  used  to  treat 
the  performance  index  (J,,)  as  an  additional 


equality 


constraint,  we  modify  Ay  as  follows; 


For  adaptively  spaced  RBF  algorithm 
we  check  the  sensitivity  of  the  terminal 
constraints  and  the  performance  index 
w.r.t.  parameters  as  follows; 
we  form  the  augmented  Jacobian 


5 


# 


av, 

5v|/, 

d^2 

SWn 

S'Vi 

d\if2 

S'Vi 

d^2 

d'Vq 

•  • 

d\N^ 

d^2 

=  [Ai,A2,**-,An] 


The  A,  vector  is  the  gradient  of  the  constraint 
and  performance  index  w.r.t.  Wj. 

Adopting  the  positive  measure  of  the  sensitivity 
w.r.t.  ith  parameter  as 

Si  =  Ai^  Ai 

we  introduce  a  new  RBF  according  to  Sj. 

With  the  newly  added  RBF  we  increment  X 
to  obtain  a  new  and  follow  the  same 
procedure  until  a  small  lncrease(A?Lniin) 
cannot  be  achieved,  while  satisfying  all 
constraints  within  a  tolerance. 


6 
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START 


Initial  w,  x,  a 


x  =  f(t,x,w) 
dx  dw 


_a]LTMk)l=[_iL.lvf(t,),  Y 

5x(tf)  _  _  _5x(tf)_ 

^new  ^  ^old  ^  A~^(AA'*')~^AY 


x  =  f(t,x,w) 
dx  d\v 


dY  T54tf)l_[  Im)^  Y 

ax(tf)__  dw  J 


Vq 

'^o  “  '^current 


STOP 
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EXAMPLE 


Fig.  1  Maximum  Radius  Orbit  Transfer  in  a  Given 


The  differential  equations  of  the  system 

r  =  u,  r(0)  =  To 
•  \x  TsirKj) 


u  = 


+ 


• 

1 

o 

E 

mt 

,u(0)  =  Uo 


•  uv  Tcos(|) 
v  = - + - v(0)-  — 


m 


0 


m 


'0 


The  terminal  constraints  : 

V|/,  =  u(t,)  =  0 


H^.  =  v(td-^  =  0 


Evenly  Spax^ed  Radial  Basis  Function  Algorithm 


Fig.  2  A  and  r{tf)  vs.  time  for  Evenly  Spaced  R.B.F.  Algorithm 


phi  (radian) 


3  R.B.F.S 

4 
2 


0  50  100 

6  R.B.F.S 


4  R.B.F.S  5  R.B.F.S 


time  :  100  ~>  3.3067(non-dim.) 


Fig.  3  (j>  vs.  time  for  Evenly  Spaced  R.B.F.  Algorithm 

3  R.B.F.S  4  R.B.F.S  5  R.B.F.S 


time  :  100  ~>  3.3067(non-<iim.) 

Fig.  4  T{tf)  vs.  time  for  Evenlt  Spaced  R.B.F.  Algorithm 
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Lamda 


Adaptively  Spaced  Radial  Basis  Function  Algorithm 
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lamda  vs.  #  of  Adaptively  Spaced  R.B.F.S 
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Fig.  5  A  and  r{tf)  vs.  for  #  of  Adaptively  Spaced  R.B.F.s 


3  R,B.F.s 


4  R.B.F.S 


5  R.B.F.s 


time  :  100  ->  3.3067(non-clim.) 


Fig.  6  <p  vs.  time  for  Adaptively  Spaced  R.B.F.  Algorithm 

3  R.B.F.S  4  R.B.F.S  5  R.B.F.s 


time :  100  ->  3.3067(nonHjim.) 

Fig.  7  r{tf)  vs.  time  for  Adaptively  Spaced  R.B.F.  Algorithm 
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Lamda 


Comparison  of  Two  Algorithms 

lamda  vs.  Number  of  Parameters 
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Fig.  8  A  and  r{tf)  vs.  time  for  Two  Algorithm 
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rONrTJIDTNG  REMARKS 


•  Radial  Basis  Function  (RBF)  Methods  Investigated 
To  Parameterize  Function  Space  Optimal 
Control  Problem 


.  Two  Variations  Studied 

•  Evenly  Spaced  Centers 
.  Adaptive  Centers 


.  Minimum  Norm  Nonlinear  Programming  Algorithm 
Used  To  Iteratively  Adjust  RBF  Weights 


•  Applied  These  Ideas  to  Low  -  Thrust 

Interplanetary  Trajectory  Optimization  Problem 


•  Our  Algorithms  Have  Been  Fully  Validated  ! 
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Abstract 

In  this  paper  we  generalize  some  previous  results 
on  attitude  representations  using  Cayley  transforms. 
First,  we  show  that  proper  orthogonal  matrices,  that 
naturally  represent  rotations,  can  be  generated  by  a 
form  of  “conformal”  analytic  mappings  in  the  space 
of  matrices.  Using  a  natural  parallelism  between  the 
elements  of  the  complex  plane  and  the  real  matrices, 
we  generate  higher  order  Cayley  transforms  and  we 
discuss  some  of  their  properties.  These  higher  order 
Cayley  transforms  are  shown  to  parameterize  proper 
orthogonal  matrices  into  higher  order  ‘‘Rodrigues 
parameters. 

1.  Introduction 

The  question  of  the  proper  choice  of  coordinates  for 
describing  rotations  has  a  very  long  and  exciting  his¬ 
tory.  Starting  with  the  work  of  Euler  and  Hamilton 
a  series  of  different  parameterizations  were  intro¬ 
duced  by  several  researchers  during  the  past  hun¬ 
dred  years.  We  will  not  delve  into  these  results  here 
since  they  can  be  found  in  any  good  textbook  on 
attitude  representations^'^.  We  just  mention  the  re¬ 
cent  survey  article  by  Shuster^  in  the  special  issue 
in  Ref.  [4]. 

In  this  paper  we  take  a  slightly  more  abstract 
point  of  view  than  the  previous  references.  Our 
main  objective  is  to  “unify”  some  of  the  existing 
results  in  the  area  of  attitude  representations.  It 
is  hoped  that  this  global  view  will  add  to  the  cur¬ 
rent  understanding  of  attitude  representations.  Our 
motivation  stems  mainly  from  the  recent  results  on 
second  order  Rodrigues  parameters®'®'^.  In  partic¬ 
ular,  in  Ref.  [7]  it  was  shown  that  these  (Modified) 
Rodrigues  parameters  can  be  generated  by  a  second 

•Assistant  Professor,  Department  of  Mechanical, 
Aerospace  and  Nuclear  Engineering.  Member  AIAA. 

tEppright  Professor,  Department  of  Aerospace  Engineer¬ 
ing.  Fellow  AIAA. 

^  Graduate  Student,  Department  of  Aerospace  Engineer¬ 
ing.  Student  member  AIAA. 


order  Cayley  transform,  the  same  way  the  classical 
Cayley-Rodrigues  parameters  are  generated  by  the 
Cayley  transform®.  Viewing  the  Cayley  transform 
as  a  bilinear  transformation  which  maps  the  space 
of  skew-symmetric  matrices  onto  the  space  of  proper 
orthogonal  matrices  (and  vice  versa)  one  is  naturally 
led  to  the  notion  of  conformal  mappings  (a  gener¬ 
alization  of  the  bilinear  transformation)  from  the 
imaginary  axis  onto  the  unit  circle  (and  vice  versa). 
We  seek  to  generalize  these  conformal  mappings  to 
matrix  spaces.  Drawing  on  the  insightful  statements 
by  Halmos®  we  show  that  such  an  intuitive  gener¬ 
alization  is  indeed  possible.  We  are  therefore  able 
to  generate  the  Euler  parameters,  the  Rodrigues  pa¬ 
rameters  and  the  Modified  Rodrigues  parameters  as 
special  cases  of  such  conformal  mappings.  Higher  or¬ 
der  Rodrigues  parameters  can  be  easily  constructed 
using  this  approach,  although  their  relevance  to  ap¬ 
plications  is  still  to  be  determined.  We  explicitly 
develop  the  third  and  fourth  order  “Rodrigues  pa¬ 
rameters”  in  order  to  illustrate  potential  advantages 
as  well  as  difficulties.  The  question  of  kinematics  of 
these  higher  order  “Rodrigues  parameters”  is  much 
more  subtle  and  is  briefly  discussed  at  the  last  sec¬ 
tion  of  the  paper.  A  more  in-depth  discussion  of  the 
kinematics  is  left  for  future  investigation. 

The  first  part  of  the  paper  reviews  the  standard 
Cayley  transform  and  it  generalizes  this  transform 
to  higher  orders.  There  is  no  restriction  on  the  di¬ 
mension  of  the  matrices  involved,  i.e.,  the  results 
hold  for  n  X  n  matrices.  In  the  second  part  of  the 
paper  we  apply  these  results  to  the  case  of  interest 
to  attitude  dynamicists,  i.e.,  the  case  n  =  3. 

Some  notation  and  terminology  is  necessary  in 
order  to  keep  the  discussion  clear  and  terse.  We 
use  the  standard  mathematical  notation  50(n)  to 
denote  the  space  of  proper  orthogonal  matrices  of 
dimension  n  x  n.  Invertible  n  x  n  matrices  form  the 
space  Gl{n),  the  general  linear  group.  The  space 
of  orthogonal  matrices  is  denoted  by  0(n)  and  it  is 
the  set  of  all  (invertible)  matrices  A  €  Gl{n)  such 
that  A^A  =  AA'^  =  I.  Clearly,  if  A  €  0(n)  then 


# 
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det(A)  =  ±1-  The  qualifier  “proper”  then  refers 
to  those  orthogonal  matrices  with  positive  determi- 
nant,  that  is, 

SO{n)  =  {AeGl{n):AA'^  =  1,  det(>l)  =  +l} 

These  matrices  represent  rotations,  while  the  or¬ 
thogonal  matrices  with  determinant  -1  represent  refle¬ 
ctions^®.  The  space  50(n)  {as  well  as  Gl{n)  and 
0(n))  forms  a  group.  We  will  see  later  on  that 
one  can  define  a  differential  equation  for  elements 
of  SO(n).  The  solutions  of  this  differential  equa¬ 
tion  form  trajectories  (one-parameter  subgroup^  on 
SO(n)  and  this  differentiable  structure  makes  5C?(n) 
actually  a  Lie  group  (i.e.  a  group  with  a  differen¬ 
tiable  manifold  structure). 

The  space  of  n  x  n  skew-symmetric  matrices  will 

be  denoted  by  so{n)  That  is, 

so(n)  =  {A  e  IR”’'”  :A  =  -A'^} 

The  space  so(n)  is  actually  the  tangent  vector  space 
to  SO(n)  at  the  identity.  This  property  can  be  e^ily 
verified  by  differentiating  A  €  SO(n).  Since  AA  = 
I  one  has  that 

±{AA'^)  =  0^AA'^  =  -AA'^ 
at 

Evaluating  the  previous  expression  ai  A  =  I  one 
obtains  that 


and  so  ^41  is  skew  symmetric. 

U=/ 

2.  The  Cayley  Transform 

Cayley’s  transformation  parameterizes  a  proper  or¬ 
thogonal  matrix  C  as  a  function  of  a  skew-symmetric 
matrix  Q.  It  is,  therefore,  a  map 

^  :  so(n)  -+  SO{n)  (1) 

The  classical  Cayley  transform®  is  given  by 

C  =  i>{Q)  =  {l-Q){l  +  Qr^ 

=  {i+Qr\i-Q)  (2) 

Since  Q  is  skew-symmetric  all  its  eigenvalues  are 
pure  imaginary.  Thus,  all  the  eigenvalues  of  the  ma¬ 
trix  I+Q  axe  nonzero  and  the  inverse  in  Eq.  (2)  ex¬ 
ists.  The  Cayley  transform  k  therefore  well-defined 
for  all  skew-symmetric  matrices.  The  inverse  trans¬ 
formation  is  identical  and  is  given  by 

Q  =  rHc)  =  HG)  =  {i-c){i+cr^ 

=  {i+cr\i-c)  (3) 


The  inverse  transformation  is  not  defined  when  C 
has  an  eigenvalue  at  -1,  because  in  this  case  dei{I + 

C)  =  0.  Since  C  is  orthogonal,  all  its  eigenvalues  lie 
on  the  unit  circle 

Si  =  {(n ,  X2)  €  IR^  if  +  *2  =  1}  (4) 

Therefore  sp{C)  C  S^,  where  sp(-)  denotes  the  spec¬ 
trum  of  a  matrix,  and  the  transformation  (3)  re¬ 
quires  that  -1  sp(C).  The  same  result  is  also 
shown  in  Ref.  [7]. 

It  is  an  easy  exercise  to  show  that  C  is  orthog¬ 
onal  if  Q  is  skew-symmetric.  In  order  to  show  that 
the  transformation  (2)  produces  only  proper  orthog¬ 
onal  matrices,  let  us  examine  the  determinant  of  C. 
Using  Eq.  (2)  the  determinant  of  C  can  be  expressed 
as 

det{C)  =  det{I-Q)det{{I^Q)~^) 

detjl  -  Q)  /gN 

det{I  -b  Q) 

Since  all  the  eigenvalues  of  Q  are  imaginary  (sp(Q)  C 
O)  they  are  of  the  form  ±:A; .  The  spectral  decom¬ 
position  of  the  matrix  Q  then  yields 

Q  =  R-^AR 

where  A  =  di(ig{aXj).  (The  matrix  Q  is  normal 
and  normal  matrices  are  always  diagonalizable“.) 
Noting  that  I±Q  =  R-^{I±A)R  we  rewrite  Eq.  (5) 
as 

det{R-^)det{I  -  A)det{R)  _  def(f-A) 
-  det{R-^)det{I  +  A)det{R)  det(/  +  A) 

- "  m=i(i+^j) 

where  2p  is  the  number  of  nonzero  (imaginary)  eigen¬ 
values  of  Q.  Therefore  C  €  SO{n)  if  Q  €  so{n)  and 
thus,  the  Cayley  transformation  is  injective  (one- 
to-one)  and  surjective  (onto)  from  the  set  of  skew- 
symmetric  matrices  to  the  set  of  proper  orthogonal 
matrices  with  no  eigenvalue  at  —1. 

3.  Cayley  Transforms  as  Conformal 
Mappings 

The  three  most  important  subsets  of  the  complex 
numbers  are  the  real  numbers,  the  imaginary  num¬ 
bers,  and  the  numbers  with  absolute  value  one  (i.e., 
the  numbers  on  the  unit  circle).  Following  the  stan¬ 
dard  mathematical  language,  we  use  the  symbols  IR, 


2 


mo 


3  =  ilR  and  5^  to  denote  these  three  sets,  respec¬ 
tively.  Trivially,  these  sets  are  subsets  of  the  com¬ 
plex  plane,  denoted  by  <D.  There  is  a  very  elegant 
Lalog  between  these  three  subsets  of  the  complex 
plane  and  the  n  x  n  matrices®,  i.e.,  the  elements  of 
jp^nxn  analog  can  be  easily  understood  and 

appreciated  as  follows:  An  elementary  result  in  ma¬ 
trix  algebra  states  that  every  n  x  n  matrix  with  teal 
elements  can  be  decomposed  into  the  sum  of  a  sym¬ 
metric  and  a  skew-symmetric  matrix.  For  example, 
any  A  €  can  be  written  as 


A  =  - r - r 


(6) 


2  •  2 

It  is  easy  to  verify  that  the  first  inatrix  in  Eq.  (6)  is 
symmetric  and  the  second  matrix  is  skew-syrnmetric. 
Symmetric  matrices  always  have  real  eigenvalues  and 
skew-symmetric  matrices  have  always  imaginary  eigen¬ 
values.  Recall  now  that  a  complex  number  c^  al¬ 
ways  be  decomposed  into  the  sum  of  a  teal  an 
an  imaginary  part.  This  parallelism  between  com¬ 
plex  numbers  and  matrices  allows  one  to  treat  the 
symmetric  matrices  as  the  “real  numbers  and  the 
skew-symmetric  matrices  as  the  “imaginary  num¬ 
bers”  in  the  set  of  IR"’'’'  matrices  •  ^ 
recall  that  an  orthogonal  matrix  in  IR  has  all  its 
eigenvalues  on  the  unit  circle.  Drawing  the  previ¬ 
ous  parallelism  even  further  we  can  therefore  treat 
the  orthogonal  matrices  as  the  “elements  on  the  unit 
circle”  in  the  space  IR"’'".  Similar  statements  can 
be  made  for  the  case  of  n  x  n  matrices  with  cotn- 
plex  entries  (elements  of  C"’'"),  where  now  hermi- 
tian,  skew-hermitian  and  unitary  matrices  have  to 
be  used  instead  of  symmetric,  skew-symmetric  and 
orthogonal  matrices,  respectively.^ 

We  intend  to  use  this  heuristic  correspondence 
between  complex  numbers  and  n  x  n  matrices  in  or¬ 
der  to  motivate  and  generalize  the  Cayley  transform 
to  higher  order.  Before  we  proceed,  we  brieBy  review 
some  elements  from  complex  function  theory  •  . 
First,  recall  that  a  (complex)  function  is  analytic  in 
an  open  set  if  it  has  a  derivative  at  each  point  in 
that  set.  In  particular,  /  is  analytic  at  a  point  zq  if 
it  is  analytic  in  a  neighborhood  of  zq.  Moreover,  an¬ 
alytic  functions  have  (uniformly)  convergent  power 
series  expansions^®. 

Definition  3.1  A  transformation  w  =  f{z)  where 
tt;,  z  €  <C  is  said  to  be  conformal  at  a  point  zq  if  /  is 
analytic  there  and  /'(zq)  #  0. 

A  conformal  mapping  is  actually  conformal  at 
each  point  in  a  neighborhood  of  zo,  since  the  ana- 
lyticity  of  /  at  zq  implies  analyticity  in  a  neighbor¬ 
hood  of  Zq.  Moreover,  since  /'  is  continuous  at  zq,  it 


follows  that  there  is  also  a  neighborhood  of  zq  with 
/'(z)  #  0  for  all  z  in  this  neighborhood  .  It  is  a 
trivial  consequence  of  the  above  definition  that  the 
composition  of  conformal  mappings  is  also  a  confor¬ 
mal  mapping. 

A  significant  special  class  of  conformal  mappings 
in  the  complex  plane  is  the  class  of  linear  fractional 
transformations  (also  called  bilinear  transformations) 
defined  by 

^  =  iad-bc^O)  (7) 

cz  -h  a 

An  important  property  of  the  linear  fractional 
transformations  is  that  they  always  transform  cir¬ 
cles  and  lines  into  circles  and  lines^®.  In  this  pa¬ 
per  we  are  interested  -  in  particular  -  in  conformal 
transformations  of  the  form  (7)  which  map  the  unit 
circle  on  the  imaginary  axis  and  vice  versa.  One 
such  transformation  is  given  by  u;  =  /(z)  where 


(8) 


It  is  an  easy  exercise  to  show  that  if  z  6  S  then 
liyl  =  1,  that  is,  w  €  and  thus,  w  is  on  the 
unit  circle.  Conversely,  if  u;  €  then  the  inverse 
transformation  z  =  f~^iw)  given  by 

^  (9) 

1  +  u; 

implies  that  the  real  part  of  z  is  zero  and  thus,  z  G  Q. 

The  inverse  transformation  (9)  is  defined  every¬ 
where  except  at  ly  =  —1.  The  point  tu  =  — 1  is 
mapped  to  infinity  (see  Fig.  1).  In  fact,  the  map 
(8)  introduces  a  one-to-one  transformation  /  ;  O  — ► 


Figure  1:  Bilinear  transformation. 

Let  us  now  introduce  the  conformal  mapping  gn  • 
S'  — ♦  5'  defined  by 

gn{w)  =  w”,  n  =  2,3,...  (10) 

The  function  gn  is  a  mapping  from  the  unit  circle 
onto  the  unit  circle.  This  transformation  is  only 


# 


# 
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locally  iniective.  Therefore  the  inverse  of  Sn  esuts 
only  locally.  Given  x  =  «"  « 

equation 


^  s  ,  71  —  2|  3, .  •  • 


yields  that 

ti,  =  e*(^),  it  =  0.1.2 . n-1  (11) 

Equation  (11)  shows  that,  in  general,  the  equation 
Y  =  «;"  has  more  than  one  solution.  This  result  wil 
turn  out  to  be  beneficial  in  section  5  when  we  discuss 
the  application  of  higher  order  Cayley-transforms 
to  attitude  representations,  because  these  root® 
h,  nsed  to  avoid  the  inherent 

dimensional  parameterizations  of  SO(3).Fo 

in  Eq.  (H)  we  get  that  tu  =  c’*.  We  will  call  this 

the  principal  nth  root  of  x*  . 

The  composition  of  the  maps  /  and  gn  is  tne 
function  :  9  -  5'  defined  by  hn  =  ffn  o  /,  that 


hn(z 


(12) 


which  maps  the  imaginary  axis  onto  the  unit  circle. 
Similarly  to  (?„,  this  map  is  only  locally  i^ertible. 
A  local  inverse  is  obtained,  for  example,  by  set  ing 
jk  =  0  in  Eq.  (ll)i  in  which  case  we  have  that  (x  - 

e«) 


:  =  c‘* 


o  ,  A 

9  =  r.,ctan(_.— j 

and  where  bar  denotes  complex  conjugate. 

4.  Higher  Order  Cayley  Transforms 

One  of  the  most  celebrated  results  in  matrix  alge¬ 
bra  is  the  Cayley-Hamilton  theorem.  This  theorem 
states  that  a  matrix  satisfies  its  own  characteristic 
polynomial.  An  important  consequence  of  this  the¬ 
orem  is  that,  given  any  matrix  A  €  IR"’"’  and  an 
analytic  function  F{z)  inside  a  disk  of  radius  r  in 
the  complex  plane,  one  can  unambiguously  define 
the  matrix-valued  function  F{A)  if  the  eigenvalues 
of  A  lie  inside  the  disk  of  radius  r.  In  other  words, 
if  F  is  given  by 


n*) 1^1 

«=:0 


then 


i=o 


and  the  previous  series  converges  assuming  that  lAj  |  < 
r  where  Xj  €  sp{A)  for  j  =  l,2,...,n.  There¬ 
fore,  the  matrix  F{A)  is  well-defined.  Moreover 
the  eigenvalues  of  the  matrix  F{A)  are  F(Aj  )[j  — 
l,2,...,n)(Ref.[ll]).  . 

Consider  now  the  conformal  mapping  /  “Oni  bq. 
(8)  which  maps  the  imaginary  axis  on  the  unit  circle. 
This  function  is  analytic  everywhere.  According  to 
the  previous  discussion,  the  matrix 

+  (1^) 


is  well-defined  for  Q  €  so(n)  and,  actually,  C 
f(Q)  6  SO(n).  Comparison  between  the  previous 
equation  and  Eq.  (2)  reveals  that  the  Cayley  trans¬ 
form  can  be  viewed  as  a  special  case  of  a  conformal 
mapping  in  the  space  of  matrices. 

We  have  seen  that  there  is  a  natural  correspon¬ 
dence  between  9  and  so(n),  as  well  as  between  S 
and  SO(n).  (We  caution  the  the  mathematically  in¬ 
clined  reader  to  take  these  statements  in  the  context 
of  the  discussion  in  section  3.  We  do  not  claim  that 
this  correspondence  carries  any  inore  weight  than 
providing  one  qualitative  motivation  for  the  gener¬ 
alization  of  certain  complex  analytic  results  to  anal¬ 
ogous  results  in  the  space  of  matrices).  Following 
Eq.  (12)  we  can  also  define  a  series  of  transforma¬ 
tions  hn  '  so(n)  SO{n)  by 


hn{Q) ={1-  Qni + Qy" = ~  ^1^4) 

where  Q  is  a  skew-symmetric  matrix.  It  should  be 
clear  by  now  that  C  =  hn{Q)  is  a  proper  orthogonal 
matrix,  i.e.,  C  €  SO(n).  We  shall  refer  to  the  femily 
of  maps  h„(Q)  in  Eq.  (14)  as  Higher  Order  Cayley 
Transforms.  The  consequences  of  such  a  generaliza¬ 
tion  in  attitude  representations  will  become  appar¬ 
ent  in  the  next  section. 

For  now,  let  us  concentrate  on  the  inverse  map 
h~^  :  SO{n)  — ►  so(n).  Since  hn  =  9n<>f  on®  obtains 
_  y-i  o  g-'^.  The  function  /"^  is  given  by 
Eq.  (9)  which,  when  applied  to  a  proper  orthogonal 
matrix  Q  with  no  eigenvalue  at  -1,  gives  the  inverse 
of  the  classical  (or  first  order)  Cayley  transform  « 
in  Eq.  (3).  The  map  g-^  :  SO{n)  SO(n)  on  the 
other  hand  requires  the  nth  root  of  an  orthogonal 
matrix.  First,  we  show  that  is  well-defined  in 
the  sense  that  the  nth  root  of  a  (proper)  orthogonal 
matrix  with  no  eigenvalue  at  -1  is  also  a  (proper) 
orthogonal  matrix  with  no  eigenvalue  at  -1.  This 
will  also  prove  that  the  composition  of  maps  5;^ 
aad  is  well-defined  since  the  range  of  is  in 

the  domain  of . 

To  this  end,  consider  an  orthogonal  matrix  C  € 
SO(n)  such  that  A  9^  -1  for  all  A  €  sp{C).  The 


4 


onn 


matrix  C  can  be  decomposed  as  follows 

C  =  UQU*  (15) 


rotation  matrix  C  traces  a  curve  in  50(3)  such  that 
C{t)  €  50(3)  for  all  t  >  0.  The  differential  equation 
characterizing  this  trajectory  on  50(3)  is  given  by 


for  some  unitary  matrix  U  i  where 

0  =  blockdiag(Qi,Q2j  •  •  •  (15) 

if  n  is  odd  and 

0  =  6/ocfcdiaff(0i,02i---i®n)  (1^) 


if  n  is  even,  and 


0 


j  =  1, . . . ,  n 


(18) 


The  diagonal  elements  of  the  matrix©  in  Eq.  (15) 
are  the  eigenvalues  of  C.  The  principal  l:th  root  of 
the  matrix  C  is  then  given  by 


d=[u;]0  (23) 


where,  given  a  vector  u  =  {(^1,(^2, ^3)  G 
matrix  [w]  is  defined  by 


H  = 


0  W3  — W2 

—0)3  0  Wi 

0)2  “U)i  0 


(24) 


In  the  sequel  we  apply  the  results  of  the  previous 
section  in  order  to  parameterize  the  rotation  group. 
In  particular,  the  series  of  conformal  mappings  from 
Eq.  (14)  provide  a  family  of  coordinates  on  50(3). 
Before  undertaking  this  task  we  investigate  another 
important  conformal  mapping. 


W  =  UQkU*  (19) 

where  —  C  and 

0jt  =  Wocfcdia</(©J,02fM0n-ii+l)  (^9) 

if  n  is  odd  and 

0  j.  =  blockdiag{Qi ,  ©2 » •  •  •  >  ©n)  (21) 

if  n  is  even,  and 

‘’nl-  )■  =  * . " 

I  0  e-'-^  . 

Since  #  -1  for  all  j  =  1 . n  (n  -  1)  the 

angles  9j  #  ±180  deg  and  thus  also  ^  ±180  deg 

for  jfc  =  2,3,...  and  thus  c’^  #  -1.  Notice  that 
in  order  to  keep  W  proper  we  always  choose  the 
positive  root  of  the  eigenvalue  ±1. 

5.  Attitude  Representations 


5,1.  The  Exponential  Map  and  the  Euler  Pa- 
rameters 

Linear  fractional  transformations  are  not  the  only 
class  of  conformal  mappings  from  the  imaginary  axis 
onto  the  unit  circle.  The  exponential  map,  defined 

by 

w  =  ezp(z)  =  c*  (25) 

also  maps  9  onto  5^.  Clearly,  if  2  =  then  \z\  =  1. 
The  inverse  transformation  is 

z  =  logw  =  i{e  +  2m:),  n  =  0,±1,±2,... 

and  is  defined  only  locally. 

We  can  therefore  define  the  exponential  map  from 
the  space  of  skew-symmetric  matrices  to  the  space  of 
proper  orthogonal  matrices.  This  exponential  map 
is  defined,  as  usual,  by 

C  =  c«  =  f;i(3'’  (26) 

n=:0 


In  this  section  we  concentrate  on  the  ramifications 
of  the  previously  developed  results  to  attitude  rep¬ 
resentations.  Our  motivation  for  investigating  Cay¬ 
ley  transforms  in  the  first  place,  stems  from  the  fact 
that  proper  orthogonal  matrices  represent  rotations. 
In  particular,  50(3)  is  the  configuration  space  of  all 
three-dimensional  rotations.  In  other  words,  every 
element  of  50(3)  represents  a  physical  rotation  be¬ 
tween  two  reference  frames  in  IR^  and  conversely, 
every  rotation  can  be  represented  by  an  element  in 
50(3). 

As  a  reference  frame,  viz.  a  body,  rotates  freely 
in  the  three-dimensional  space,  the  corresponding 


and  the  series  converges  for  every  Q.  For  the  three- 
dimensional  case,  the  matrix  Q  €  5o(3)  can  be  pa¬ 
rameterized  by 


Q  = 


0  03  —02 

—03  9  0i- 

02  -01  0  , 


(27) 


As  before,  given  a  vector  0  =  (/?i, ^21/^3)  €  IR  we 
will  also  use  the  notation  \0]  to  denote  the  skew- 
symmetric  matrix  in  Eq.  (27).  Noticing  that 


[0]'^  =  00'^  -  Mil'll 
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one  obtains  that 

fc  =  0.1,2.... 

and 

Substituting  the  previous  expressions  in  Eq.  (26)  we 
get  Euler’s  formula* 

\B\  X  00^ 

C{0)  =  =  cos^/  +  sin^  ^  +  (1  —  cos 4>) 

where  4>  =  11)311.  Equivalently, 

e'^^  =  I  +  s{n<l>^  +  {l-cos<t,)^  (28) 

Normalizing  the  vector  /?  we  get  a  unit  vector 
e  =  -^ 

m\ 


0  =  <l>i  (29) 

Euler’s  theorem^  states  that  any  rotation  can  be 
represented  by  a  finite  rotation  (principal  rotation) 
about  a  single  axis  (principal  axis).  That  is,  the 
principal  axis  and  the  principal  angle  suffice  to  de¬ 
termine  the  rotation  matrix.  From  a  mathematical 
perspective  this  amounts  to  parameterizing  every  el¬ 
ement  in  50(3)  by  the  principal  a)ds  and  the  prin¬ 
cipal  angle. 

By  letting  the  principal  axis  be  along  the  direc¬ 
tion  of  the  unit  vector  e  and  by  letting  the  principal 
angle  be  ^  as  above,  Eq.  (28)  shows  how  this  pa¬ 
rameterization  is  achieved.  Clearly, 

C(0,e)  =  e^t*J  (30) 


Moreover,  introducing  the  Euler  parameter  vector 
q  =  (90,91.92.93) 

50  =  cos  qi  =  ii  sin  i  =  1, 2, 3  (31) 


and  substituting  in  Eq.  (28)  one  obtains  the  well- 
known  formula  for  the  rotation  matrix  in  terms  of 
the  Euler  parameters 


0(9)  = 


9o  +  9i  “  92  ~  93  2  (qi?2  -f  9093) 

2  (9192  -  9093)  9o  -  9?  +  92  -  93 
2  (9193  +  9092)  2  (9293  -  9o9i) 


2  (9193  -  9092) 

2  (9293  +  9091) 

9o  -  9i  -  92  +  93  . 


(32) 


Therefore,  the  Euler  parameter  representation 
is  obtained  by  generalizing  the  conformal  mapping 
in  Eq.  (25)  to  the  space  of  matrices.  Notice  from 
Eq.  (32)  that  C(g)  =  0(-9)  and  both  q  and  -q 
can  be  used  to  describe  the  same  physical  orienta¬ 
tion.  This  fact  can  be  used  to  construct  alternative, 
or  “shadow” ,  sets  of  kinematic  parameters  obtained 
via  the  Cayley  transforms. 


5.2.  Rodrigues  Parameters 

Since  the  Euler  parameters  satisfy  the  additional 
constraint  9?  -h  9i  +  92  +  93  =  1.  o^e  is  naturally 
led  to  consider  the  elimination  of  this  constraint, 
thus  reducing  the  number  of  coordinates  from  four 
to  three.  The  Rodrigues  parameters  achieve  this  by 
defining 

=  i  =  1.2,3  (33) 

90 

The  three  parameters  puP^tPz  then  provide  a  three- 
dimensional  parameterization  of  50(3).  The  inverse 
transformation  of  Eq.  (33)  is  given  by 


90  = 


(1  +  P-) 


;2^1> 


= 


Pi 


(1+p*) 


i  =  1,2.3 


(34) 


where  p^  =■  p\-¥ p\-\- p\-  The  Rodrigues  parameters 
are  related  to  the  principal  axis  and  angle  through 
the  equation 

4>  . 

p  =  tan  —  e 


The  rotation  matrix  in  terms  of  the  Rodrigues  pa¬ 
rameters  can  be  easily  computed  using  Eq.  (32)  and 
Eq.  (34). 


C{p)  = 


1 

l  +  p2 


1  -  +  2/)J  2  (P1P2  +  pz) 

2  {plp2  —  Pz)  1  —  +  2p2 

2(P3P1+P2)  2(P2P3~Pi) 


2(P3Pi  -P2) 
2(P2P3  +Pl) 
l-p^  +  2pl 


(35) 


It  is  remarkable  the  fact  that  the  previous  parame¬ 
terization  of  50(3)  can  also  be  achieved  by  means 
of  the  Cayley  transformation  in  Eq.  (2).  Indeed,  if 
we  introduce  the  skew-symmetric  matrix 


R=-[p]  = 


-pz 

0 

Pi 


P2 

-Pi 

0 


the  transformation 


C  =  {I-R){I  +  R)-^  =  {I  +  R)-\I-R)  (36) 
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produces  exactly  the  matrix  in  Eq.  (35).  There¬ 
fore  the  classical  Cayley-Rodrigues  parameters  rep¬ 
resentation  is  obtained  by  generalizing  the  confor¬ 
mal  mapping  in  Eq.  (8)  to  the  space  of  matrices. 


5.3.  Modified  Rodrigues  Parameters 


The  normalization  in  Eq.  (33)  is  not  the  only  pos¬ 
sible  one.  A  more  judicious  normalization  for  elim¬ 
inating  the  Euler  parameter  constraint  is  through 
stereographic  projection^^’^^’*'^.  Using  this  approach, 
the  new  variables 


gj 

1  +  go’ 


i=  1.2,3 


(37) 


provide  coordinates  on  SO(3).  These  parameters 
are  referred  to  in  the  literature  as  the  Modified  Ro¬ 
drigues  psaameters®  and  have  distinct  advantages 
over  the  classical  Rodrigues  parameters.  In  partic¬ 
ular,  while  the  Rodrigues  parameters  do  not  allow 
eigenaxis  rotations  of  more  than  180  deg,  the  Mod¬ 
ified  Rodrigues  parameters  allow  for  eigenaxis  rota¬ 
tions  of  upto  360deg®’^*^^’“'^®.  This  can  be  imme¬ 
diately  deduced  by  the  corresponding  relationship 
between  <r  and  the  principal  axis  and  angle 


<l>  . 

a  =  tan  -r  e 
4 

which  is  well-behaved  for  0  <  ^  <  2x.  Since  both  q 
and  — g  describe  the  same  physical  orientation  (re¬ 
call  the  discussion  at  the  end  of  section  5.1),  a  second 
set  of  parameters  defined  by 


gj 

1-go’ 


i  =  1.2,3 


referred  to  as  the  “shadow”  set^®,  can  be  used  to 
describe  the  same  physical  orientation.  These  pa¬ 
rameters  are  also  given  by 


The  transformation  between  tr  and  is  given  by^® 


(38) 


where  g  —  g\  -i-  ^3  —  tan* 

The  rotation  matrix  associated  with  the  Modi¬ 
fied  Rodrigues  Parameters  is  given  by 


C((7)= 


1  +  a^ 


4Ei  +  S*  -h 

8(Ti(T2  —  4<r3E  4^2  -h  E*  _ 

dcTiaa  -f-  4<T2E  8<T2<T3  —  4<7’iE 


8o’i<r3  —  4(7’2E 
8cr2<r3  +  4(TiI) 
4E3  -f  E* 


(39) 


where  E  =  1  -  d*  and  Ej  =  — d*  4-  2crj ,  j  =  1, 2, 3. 

In  Ref.  [7]  it  was  shown  that  these  parameters 
are  defined  by  a  Cayley  transformation  of  second 
order.  That  is,  if 


s  =  -M  = 


0  -G3  ff2 

ff3  0  — <Tl 

—G2  O’!  0 


then  the  transformation 


C  =  (I  -  5)*(/  -1-  S)-2  =  (J  -h  5)-*(/  -  S)*  (41) 

produces  exactly  the  matrix  in  Eq.  (39).  Notice  that 
the  inverse  of  the  transformation  (41)  is  not  unique 
and  it  requires  the  square  root  of  an  orthogonal  ma¬ 
trix.  Given  C  €  50(3)  we  find  a  matrix  W  such 
that 

C  =  W^  (42) 

Once  a  matrix  W  is  calculated,  the  skew-symmetric 
matrix  S  containing  the  Modified  Rodrigues  param¬ 
eters  is  computed  from 

S  =  (J  -  VU)(/  +  W)-'-  =  {1  +  -  W)  (43) 


Reference  [7]  outlines  this  approach.  To  every  or¬ 
thogonal  matrix  corresponds  a  principal  angle  and 
a  principal  direction  according  to  Eq.  (30).  From 
Eqs.  (30)  and  (42)  one  therefore  has  that 


W  =  e  (44) 


and  W  has  half  the  principal  angle  of  0.  It  should 
be  apparent  now  how  the  Modified  Rodrigues  pa¬ 
rameters  double  the  domain  of  validity  of  the  pa¬ 
rameterization  by  taking  the  square  of  the  classical 
Cayley  transform. 

This  observation  motivates  the  search  of  higher 
dimensional  Cayley  transforms  for  attitude  repre¬ 
sentations.  Such  transformations  are  expected  to 
increase  the  domain  of  validity  even  further.  This  is 
the  topic  of  the  next  section. 


5.4.  Higher  Order  Rodrigues  Parameters 

According  to  the  discussion  in  the  previous  section 
one  expects  that  higher  order  Cayley  transforma¬ 
tions  will  increase  the  domain  of  validity  of  the  cor¬ 
responding  parameters.  The  main  task  of  this  sec¬ 
tion  is  to  derive  these  higher  order  parameters  and 
find  their  connections  to  the  Rodrigues  parameters, 
the  Modified  parameters  and  the  Euler  parameters. 
To  this  end,  consider  first  the  fourth  order  Cayley 
transform  defined  by 

C=il-T)\l  +  T)-^  (45) 
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for  some  skew-symmetric  matrix 

T=-[r]  = 


T2 


0  -T3 

rs  0  -n 

-T2  n  0 


(46) 


We  know  that  the  matrix  C  is  (proper)  orthogonal. 

Recall  from  the  results  of  section  3  that  if  F  is 
analytic  function,  then  the  eigenvalues  of  the  matrix 
^(^4)  are  given  by  F(Aj)  where  Xj  are  the  eigenval¬ 
ues  of  A.  It  is  an  easy  exercise  to  show  that  the 
eigenvalues  of  the  skew-symmetric  matrix  in  Eq.  (46) 
are  given  by 

0,  ±i(i'i +r|  +  r|)>  (47) 

Similarly,  the  eigenvalues  of  the  matrix  S  in  Eq.  (40) 
are  given  by 

0,  -h  cr3)i  (48) 

Let  Ar  denote  an  eigenvalue  of  T  and  A^  an  eigen¬ 
value  of  5.  Comparing  Eqs.  (41)  and  (45)  one  sees 
that  the  matrices  S  and  T  are  related  by 


and  computing  C  from  Eq.  (41)  verifies  the  expres¬ 
sion  in  Eq.  (51). 

The  relation  between  r  and  q  is  obtained  by  ob¬ 
serving  that 

=  i  =  1.2.3  (52) 


2t,- 


1  -  rf  -  r|  -  r|  1  +  ?o  ’ 

Using  the  shorthand  notation  =  rf  +  t|  -f  r|  the 
previous  expression  can  be  written  as 


2rj-  _  qj 

l-f2  1  +  qo' 


i  =  1.2,3 


Therefore, 


_  g?  +  gj  +  ql 


or 


^(l-f2)2“  (1  +  go)* 


(I  -  5)(7  +  S)-^  =  (7  -  Tf{I  +  T)-^ 

This  suggests  that  A^  and  Ar  3-re  related  by 

l-A,  ^  A-AA’ 

1  +  A„  \1-1-Ar/ 

“  1  +  ,  _(i±^ 

1  +  A^- 

Solving  for  A^  one  obtains  that 

2Ar 


g?  4-  gj  +  g3  +  (1  +  go)^ 
(1  +  go)^ 

2(l  +  go)_  2 
(1  +  go)^  1  +  go 


(53) 


(49) 


(50) 


or  that 


and  thus, 


l±i^  =  ± 

l-f2 


■\/l  +  go 
2  _  dbV^  +  v^l  d"  go 


1  -  y/1  +  go 

Using  now  Eq.  (52)  one  finally  obtains  that 


^  1  +  5o  ±  \/2(l  +  go) 

Conversely,  from  Eq.  (53)  one  obtains  that 

and  using  Eq.  (52)  that 


Aa  = 


1  +  Aj 


Substituting  the  expressions  for  A^  and  Ar  from 
Eqs.  (47)  and  (48)  in  the  previous  equation  one  ob¬ 
tains  that 

d-T-l)^ 


j  =  1.2,3  (54) 


i2\2 


(55) 


4ri(l  -  f*)  .  1  o  q 

.  -ovT >  J— 1)2,0 


9i-  + 

From  Eq.  (55)  we  also  have  that 


,  .  „±i(T?  +  T2 +T3) 

±l(or?  +  0-2  +  0-|)’  -  2 

Upon  squaring  this  expression  one  obtains 

2^  2  .  2_.__!i±!2±l3_ 

<ri  -f  0-2  +  <^3  _  4^^  _  ^2  _  ^2  _  .^2)2 

This  equation  suggests  that  <r  and  t  are  related  by  go  =  2  K  1  -  1  = 

± j  =  1.2,3  (51) 

1  —  Tj  —  r2  —  r3 

Arbitrarily,  and  without  loss  of  generality,  we  choose 
the  solution  with  the  plus  sign.  Substitution  in  S 


(1  -  6f^  +  f'*) 
(H-f2)2 


where  Letting  W  =  {I  -  T)(7  +  T)-^ 

and  since  C  =  W*  one  obtains  that 

W  =  e 


8 


•where  4>  is  the  principal  angle  of  C.  Moreover,  using 
the  definition  of  the  Euler  parameters  from  Eq.  (31) 
one  obtains  the  following  result  for  the  t  parameters 


—  c  (56) 
1  +  cos  f  ±  >/2(l  +  cos  f ) 


where  c  is  the  unit  vector  along  the  principal  ^is. 
Using  the  trigonometric  identity  cos  |  =  2  cos  ^  - 1 , 
the  previous  equation  reduces  to 


r 


sin 


± 


1  +  cos  I  ±  2  cos  I 


(57) 


Keeping  the  plus  sign,  Eq.  (57)  can  be  further  re¬ 
duced  to  the  simple  formula 


T+  =  tan  I  c,  (-4ir  <  ^  <  47r)  (58) 

o 

From  Eq.  (58)  it  is  apparent  that  r  is  proportional 
to  the  principal  rotation  axis,  like  the  classical  and 
the  Modified  Rodrigues  parameters,  where  now  the 
proportionality  factor  is  f{4>)  =  tanf.  A  plot  of 
f{4>)  is  shown  in  Fig.  2. 


have  a  unique  set  of  **shadow”  parameters  like  the 
Modified  Rodrigues  parameters'®.  These  parame¬ 
ters  are  obtained  by  setting 


_ 


—  Sin  • 


l-cosf  ±2sinf 


(60) 


In  can  be  easily  verified  that  the  corresponding  “shadow” 
parameters  reduce  to 


tan  t  —  1  . 

- ! - c 

tanf-t-l 


(-27r  <  ^  <  6ir) 


(61) 


and 

J.S  _  1  +  I  (_6x  <(j)<2ir)  (62) 

1  —  tan  I 

As  the  original  r  parameters  approach  -1-1,  the  asso¬ 
ciated  “shadow”  parameters  r*  approach  zero  and 
vice  versa.  The  general  transformation  between  the 
original  and  the  “shadow”  set  is  given  by 


\2f2-}-(i-hf2)fy 


(63) 


Figure  2:  Plot  of  /{<!>)■ 

Equation  (58)  is  reassuring,  since  it  proves  that 
the  T  parameters  indeed  behave  as  “higher  order 
Rodrigues  parameters  which  can  be  used  to  “lin¬ 
earize”  the  domain  of  validity  of  the  kinematic  pa¬ 
rameterization.  By  this,  we  mean  that  Eq.  (58)  be¬ 
haves  almost  linearly  as  a  function  of  the  principal 
angle  (ji  (especially  in  the  region  —it/S  <  ^  <  ’’’/S)! 
see  also  Fig.  3. 

If  we  choose  the  minus  sign  in  Eq.  (56)  we  obtain 
that  . 

r_  = - T-  e,  (0  <  ^  <  87r)  (59) 

tanf 

Moreover,  reversing  the  signs  of  the  Euler  parame¬ 
ters  in  Eq.  (54),  one  obtains  that  the  r  parameters 


where  f  =  (f-)a .  Equations  (58),(59),(61)  and  (62) 
can  be  used  in  order  to  compute  the  four  distinct 
roots  of  Eq.  (45).  Note  also  that  Eqs.  (58),(61),(59) 
and  (62)  can  be  also  written  in  the  form 

r  =  tan(|-lk^)e,  i  =  0,l,2,3 
respectively. 

The  “shadow”  parameter  set  t’  is  shown  side-by- 
side  with  the  original  r  parameters  in  Fig.  3.  The 
shadow  set  is  plotted  in  grey  color.  Figure  3  also 
shows  that  t  parameters  are  indeed  very  linear  for 
small  rotations  within  ±180  deg. 

As  with  the  Modified  Rodrigues  parameters  (and 
other  stereographic  parameters'®),  these  “shadow” 
parameters  represent  the  same  physical  orientation 
as  the  original  set  and  abide  by  the  same  differen¬ 
tial  kinematic  equation.  They  could  be  used  to  avoid 
the  problems  of  approaching  the  ±720  deg  principal 
rotation.  By  switching  to  the  shadow  trajectory, 
all  numerical  problems  would  be  avoided.  Having, 
however,  a  principal  rotation  range  of  ±720  deg  is 
really  mote  than  needed.  Limiting  the  principal  ro¬ 
tations  to  be  within  ±180  deg  would  suffice  and  be 
much  more  attractive.  As  the  magnitude  of  t  ap>- 
proaches  tan  |  then  one  would  simply  switch  the  r 
to  their  “shadow”  set.  Having  |t|  =  tan  |  corre¬ 
sponds  to  5o  —  0-  From  Eq.  (54)  one  can  then  see 
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Figure  3:  Comparison  of  original  and  “shadow  r 
parameters. 


that  at  this  point,  the  two  sets  of  parameters  are 
related  by  r  =  -r*.  The  combined  set  of  original 
and  “shadow”  r  parameters  would  provide  a  set  of 
attitude  coordinates  which  are  “very  linear”  with  re¬ 
spect  to  the  principal  rotation  angle,  more  so  even 
than  the  Modified  Rodrigues  parameters.  We  note 
ia  passing  that  the  previous  approach  can  be  easily 
extended  to  any  Cayley  transform  of  order  2  ,  since 
Eqs.  (49)  and  (50)  can  be  used  iteratively. 

For  the  third  order  Cayley  transform  we  have 


that 

c  =  (I  -  P)\I  +  P)-^  =  a  + 

where  P  =  -[p]  and  p  =  (pi,P2,P3)  the  correspond¬ 
ing  parameters.  If  Ap  and  Ap  denote  the  respective 
eigenvalues  of  the  skew-sj-mmetric  matrices  R  and  P 
then,  using  Eqs.  (36)  and  (64),  they  must  be  related 
by 


1-A, 


= 

Vl  +  Apy/ 


1  + 

r,  upon  expanding  the  previous  equality 


1  -  ^  _  1  -  +  3A^  -  3Ap 


thus 


1  +  Ap  1  +  A^  +  ZXp  +  3Ap 
Id-Ap- 


Solving  for  Ap  we  obtain 


,  Ap(3  +  A|) 

~  1  +  3AH 


In  order  to  get  the  relation  of  p  to  the  Euler 
parameter  vector  one  can  set 

Piii  -Pi-pI-  pI)  _  ii  (65) 

l-Z{pl-\rpl  +  pl)  90 

and  solve  for  f  =  pl+pl+pl  After  some  algebraic 
calculations,  it  is  not  difficult  to  show  that,  in  fact, 


(l-3p2)2  ,2 


(66) 


Solution  of  the  previous  equation  for  p*  requires  the 
solution  of  a  cubic  equation.  Once  p^  is  known  how¬ 
ever,  it  can  be  substituted  into  Eq.  (65)  to  get  the 
desired  result.  Actually,  from  Eqs.  (65)  and  (66)  we 
have  that 


90  = 


l-3p^ 

(1  +  P^)^’ 


9;  = 


Pi(3-P^) 
(1  +  P=)^’ 


i  =  1,2.3 


Letting  W  =  {I  -  P){I  +  ^’)"^  since  C  =  W^ 
one  obtains  that 


w  = 


where  <j>  is  the  principal  angle  of  C. 


6.  Kinematics 


The  kinematic  equations  in  terms  of  the  t  param¬ 
eters  can  be  computed  as  follows.  From  Eqs.  (23) 
and  (45)  we  have  that 

c  =  ^[(I-r)"](f  +  T)-'‘-h(/-T)‘‘^[(I  +  T)-'‘l 

=  S(w)(I-T)^(/  +  T)-‘‘ 


or  that 

^[iI-T)*]-C{T)^[{I+T)*]  =  S(u)(/-T)^  (67) 
dt 

where  we  have  used  the  fact  that 


for  any  square  matrix  A.  Using  also  the  fact  that 


The  previous  equation  suggests  that  pj  and  pj  are 
related  by 


■  pdZ-Pi-pl-pl) 

^^■"^1-3(P?+Pl  +  Pi)’ 


i  =  1,2,3 


and  performing  the  differentiations  in  the  left-hand- 
side  of  Eq.  (67),  one  obtains  a  set  of  nine  linear  equa¬ 
tions  in  terms  of  ti,T2-  and  T3.  Similarly,  the  right- 
hand-side  of  Eq.  (67)  is  linear  in  terms  of  wi,W2,W3- 


Choosing  three  (independent)  equations  out  of  these 
nine,  we  get  a  linear  system  of  the  form 

V{t)t  =  U(t)u> 


Solving  for  t  we  finally  get  that  the  kinematic  equa¬ 
tions  for  the  T  orientation  parameters  are  given  by 


$1  =  V-HrW(T)u,  =  G(r)« 
at 


where  the  matrix  G(t)  is  given  by 

1 


G{r)  = 


l-f2 


Ti  +  r|r|  -  3(r|  +  r|) 
2r3(l  -  T*)  +  nT2(3  -  f2) 
[  -2t2(1  -  T^)  +  nT-3(3  -  f^) 


-2t3(1  -  f2)  +  Tir2(3  -  f*) 
T2  +  tItI  -  3(r|  +  rf ) 
2ri(l  -  f^)  +  T-iTsiZ  -  f2) 

2r(l  —  f^)  -h  nrsiZ  -  f^) 
-2ri(l  -  f2)  +  r2T3[Z  -  f2) 
T3  +  rf  r|  -  3(ti2  -1-  r|) 


(68) 


and Tj  =  5(1  +  Ti  +T2  +t^  —  2Tj^),  j  —  1. 2, 3.  This 
equation  can  be  written  more  compactly  in  a  vector 
form  as  follows 
dr  _ 

di  ~  8(1  -  f2) 

-  4(l-f2)[r]-t-(l-6f*4-07]a;  (69) 


-i^[2(3-fVr^ 


These  kinematic  equations  are  not  as  simple  as 
the  corresponding  kinematic  equations  for  the  Ro¬ 
drigues  or  the  Modified  Rodrigues  parameters^'^^. 
Moreover,  there  is  an  apparent  singularity  at  f  = 
±1,  equivalently  at  ^  =  ±2ir.  The  limiting  behavior 
of  these  equations  as  f  — ♦  ±1  will  be  determined 
through  further  analytical  and  numerical  studies. 
At  any  rate,  because  of  the  near-linear  behavior  be¬ 
tween  4>  and  the  magnitude  of  t  as  seen  in  Fig.  2, 
for  small  principal  angles,  Eq.  (69)  is  expected  to  be¬ 
have  in  a  more  “linear-like”  fashion  than  either  the 
Cayley-Rodrigues  or  the  Modified  Rodrigues  param¬ 
eters. 

Similarly,  for  the  third  order  Cayley  parameters, 
one  can  derive  the  following  kinematic  equations 


dt 


[(11  -  p^)pp^ 


6(3 -p2) 

3(3-p^)b]  +  3(l-3p")7]w 


(70) 


These  equations  can  be  derived  starting  from  Eqs. 
(23)  and  (64)  and  using  similar  arguments  as  before. 
Singularities  for  the  p  parameters  are  encountered  at 
p  =  ±v^.  As  before,  further  analysis  is  required  to 
determine  the  limiting  behavior  of  this  system  as 
p—*±y/Z. 


7.  Numerical  Example 

In  order  to  demonstrate  the  potential  benefits  or 
drawbacks  of  the  previous  kinematic  parameters  the 
following  simulation  was  performed.  We  integrated 
Eqs.  (69)  as  well  as  the  corresponding  kinematic 
equations  in  terms  of  the  Cayley-Rodrigues  (p)  and 
the  Modified  Rodrigues  parameters  (cr)  starting  from 
the  zero  orientation  and  subject  to  the  constant  an¬ 
gular  velocity  vector  w  =  (0.25, 0.4,  —0.1)  {rad/ sec). 
This  corresponds  to  a  linearly  increasing  value  of  the 
principal  angle  <t>.  The  results  of  the  simulations  are 
shown  in  Fig.  4.  This  figure  actually  shows  only  the 
first  components  of  the  kinematic  parameter  vec¬ 
tors,  as  the  other  two  components  exhibit  similar 
behavior. 


COMPABISONOFOniENTATION  PARAMETERS 


Figure  4:  Orientation  parameter  comparison. 

As  it  is  evident  from  this  figure,  the  classical  and 
the  Modified  Rodrigues  parameters  encounter  the 
singularity  earlier  that  the  t  parameters.  We  note, 
however,  that  since  discontinuities  in  the  parameter 
description  are  typically  acceptable  in  applications, 
the  Modified  Rodrigues  parameters  can  be  made  to 
avoid  the  singularity  altogether  by  simply  switch¬ 
ing  to  their  “shadow”  set^®.  The  same  also  holds 
for  the  T  parameters  via  Eq.  (63).  Figure  5  shows 
the  simulation  where  the  parameters  <r  and  t  are 
allowed  to  switch  to  their  respective  “shadow”  sets. 
Although  the  points  of  switching  are  arbitrary  and 
can  be  chosen  according  to  the  particular  applica¬ 
tion,  a  reaisonable  choice  is  to  switch  when  the  pa¬ 
rameters  and  the  corresponding  “shadow”  set  have 
opposite  signs.  This  will  ensures  continuity  of  the 
magnitude.  From  Eqs.  (38)  and  (63)  this  occurs 
when  «f>  —  k  X,  k  =  ±1,  :k2, . . ..  This  is  the  situation 
depicted  in  Fig.  5.  The  t  parameters  are  shown  in 
solid  line,  and  the  a  parameters  are  shown  in  dashed 
line. 
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sets. 


Since  the  classical  Rodrigues  parameters  do  not 
have  an  associated  “shadow”  set  (better,  the  shadow 
set  coincides  with  the  original  parameters),  only  the 
the  o  and  t  parameters  are  plotted  in  Fig.  5. 

8.  Conclusions 

We  have  extended  the  classical  Cayley  transform 
which  maps  skew-symmetric  matrices  to  proper  or¬ 
thogonal  matrices  to  higher  orders.  The  approach 
is  based  on  the  observation  that  Cayley  transforms 
can  be  viewed  as  generalized  conformal  (bilinear) 
mappings  in  the  space  of  matrices.  The  Euler  pa¬ 
rameters,  the  Rodrigues  parameters  and  the  Modi¬ 
fied  Rodrigues  parameters  follow  as  special  cases  of 
this  approach.  In  addition,  we  generate  a  family  of 
higher  order  “Rodrigues  parameters”  which  could  be 
used  as  coordinates  for  the  rotation  group.  It  still 
remains,  however,  to  determine  the  applicability  of 
these  higher  order  parameters  in  realistic  attitude 
problems. 
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AN  EIGENFACTOR  SQUARE  ROOT  ALGORITHM 
FORMULATION  FOR  NONLINEAR  DYNAMICS 

John  L.  Junkins*  and  Hanspeter  Schaub^ 


A  novel  method  is  presented  to  solve  the  equations  of  rnotion  for  a  large 
class  of  constrained  and  unconstrained  dynamical  systems.  Given  an  analytic 
expression  for  the  system  mass  matrix,  quasi-coordinate 
are  derived  in  a  manner  that  generates  equations  analogous  to  the  dynam¬ 
ics/kinematics  partitioning  in  Eulerian  rigid  JJ'®  ufcS 

is  accomplished  by  introducing  a  new  quasi  velocity  coordinate  t]  which  y^lds 
a  dynami«l  system  with  an  identity  mass  matrix.  The  problem  of  inverting 
a  complex  mass  matrix  is  replaced  by  the  problem  of  solving  two  first  order 
differential  equations  for  the  mass  matrix  eigenfactors.  A  new  method  s  i n- 
troduced  whereby  dynamical  constraint  equations  are  solved  using  a  related 
eigenfactor  formulation,  forgoing  any  need  to  solve  the  algebraic  constraint 
equations  simultaneously  with  the  differential  equations  of  motion. 


INTRODUCTION 

The  equations  of  motion  of  complex  dynamical  systems  are  usuaUy  second  order  noi^ear  dif¬ 
ferential  equations  which  require  taking  the  inverse  of  a  time-varying,  configuration  variable  mass 
matrix.  Such  dynamical  systems  could  be  a  large  nonlinear  deformation  model  for  an  ^bitr^ 
body,  a  multi-body  system  or  a  multi-link  robot  arm.  One  reason  why  the  resultmg  d^armcs 
are  compHcated  is  that  they  are  usually  written  in  a  way  that  combines  coordinates  natmal  to 
the  momentum  or  energy  description  with  those  natural  to  the  displacement  d^cnption.  The  r^ 
suit  is  a  split  between  momentum  differential  equations  and  kinematic  differential  equations.  This 
natural  spUtting  is  typically  destroyed  when  the  generalized  methods  of  mechamcs  are  employed 
and  result  in  a  more  compUcated  mass  matrix.  This  occurs  when  the  cl^sical  Lagrange  equations 
of  motion  are  written  in  terms  of  a  generalized  coordinate  and  their  time  derivatives.  By  usmg 
Newton-Eulerian  medianics  or  the  Boltzmann-Hamel  version  of  Lagrmge’s  equations,  it  is  possi¬ 
ble  to  introduce  quasi-coordinates  whidi  separate  the  decision  of  choosing  displacement  coordmat^ 
and  velocity  (momentum)  coordinates.  As  is  well-known,  (e.g.  Eulerian  rigid  body  dynamics),  this 
process  often  leads  to  much  more  attractive  equations  than  those  that  result  from  brute  force 
application  of  Lagrange’s  equations.  It  is  possible  to  bring  the  equations  of  motion  to  then  most 
convenient  form  with  a  constant  mass  matrix-^-^  For  general  configuration-vanable  mass  matnces, 
there  has  not  been  a  generally  applicable  method  to  accomplish  an  analogous  transformation. 

Several  methods  have  been  proposed  to  carry  out  the  mass  matrix  inverse^-®  raging  ^0™  taking 
an  algebraic  inverse,  to  using  traditional  numerical  inverse  methods  (such  as  a  Cholesky  decomp^ 
sition)  to  the  elegant  method  of  using  the  innovations  factorization.®  Naturally  each  method  has  its 
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advantages  and  disadvantages.  The  algebraic  inverse  in  only  feasible  for  relatively  small  systems, 
even  with  symbol  manipulation  programs  such  as  Mathematica  and  Maple.  Taking  a  numen^ 
inverse  at  each  integration  step  is  computationally  costly  and  difficult.  The  method  proposed  by 
Ref  2  uses  the  innovations  factorizations  technique  to  parameterize  the  mass  matrix  and  recur¬ 
sively  approximate  its  inverse.  The  mass  matrix  factors  involved  are  obtained  from  a  recursive 
filter.  However,  this  recursive  filter  is  conveniently  applicable  only  to  a  linked  body  cham  and  other 
kinematically  recursive  topologies. 

This  paper  presents  a  method  to  solve  a  very  general  class  of  construed  and  imconstr^ed  dy¬ 
namical  systems  and  avoids  the  necessity  of  inverting  a  configuration  variable  mass  matrix  to  obtain 
instantaneous  accelerations.  The  equations  of  motion  will  be  separated  into  dynamical  ^d  kine¬ 
matic  differential  equations  somewhat  analogous  to  classical  developments  m  npd  body  dynamics. 
The  mass  matrix  will  be  initially  parameterized  by  a  numerical  eigenfactor  decomposition.  After  ra- 
taMisbing  this  initial  condition,  only  the  eigenvectors  and  the  eigenvalues  of  the  mass  matrix  be 
forward  integrated  from  differential  equations  derived  herein.  The  resulting  method  will  require  no 
matrix  inverse  to  be  taken.  The  eigenfactor  differential  equations  are  solved  by  extending  an  elegant 
square  root  algorithm  proposed  by  Oshman  and  Bar-Itzhack<  to  solve  the  matrix  Riccati  equation. 
The  formulation  also  allows  any  Pfaffian  constraints  to  easily  be  incorporated  mto  the  equations 
of  motion,  thus  avoiding  having  coupled  algebraic  constraint  equations  to  be  solved  simult^eously 
with  the  ori^al  equations  of  motion.  The  implications  of  these  developments  for  both  efficiency 
and  accuracy  are  enormous. 

PROBLEM  FORMULATION 

The  equations  of  motion  for  a  dynamical  system  are  usually  derived  by  first  formulating  the 
kinematic  energy  T  and  the  potential  energy  V.  Let  the  system  Lagrangian  £  be  defined  as 

C  =  T-V  (1) 

Let  X  be  the  system  state  vector,  then  the  potential  energy  is  given  by 

V  =  V{x)  (2) 

The  kinetic  energy  can  be  written  in  terms  of  the  generalized  configuration  coordinate  vector  deriva¬ 
tive  X  or  in  terms  of  a  quasi-velocity  vector  y  defined  as 

y  =  P{x)x  (3) 

A  field  where  quasi-velocities  are  often  preferred  over  configuration  coordinate  derivatives  is  in  rigid 
body  dynamics.  For  example,  it  is  much  simpler  to  write  the  system  kinetic  energy  in  terms  of  the 
body  angular  velocity  u  then  in  terms  of  the  Euler  attitude  angle  derivatives  6.  Let  M  (x,  f)  be  the 
mass  matrix  for  a  system  described  with  y,  then  the  kinetic  energy  is  given  by 

T  =  T2  +  Ti  -I-  To  =  t)y  +  G^(x, t)y  +  To(x,  t)  (4) 

where  the  Ti  and  To  terms  only  appear  in  unnatural  systems.  However,  to  find  the  traditional  version 
of  Lagrange’s  equations  of  motion  the  kinetic  energy  needs  to  be  wntten  in  terms  of  generalized 
coordinate  derivatives,  not  quasi- velocities.  Using  Eq.  (3),  the  kinetic  energy  can  be  rewritten  in 
terms  of  db. 

T2  =  |x^P(x)^itJ(x,t)P(x)x  =  |x^Af(x,0i  (5) 

Ti  =  G^(x,  f)P(x)x  =  G’’(x,  f)x  (6) 

where  Af  (x,  t)  =  P(x)^Jtf(x,  t)P{x)  is  the  system  mass  matrix  for  the  state  vector  (x,x)  ^d  G(x,  t)  = 
P'^(x)G(x,t).  For  mechanical  systems  M{x,t)  will  always  by  symmetric  positive  definite.  Let  Q  be 


oin 


a  non-conservative  forcing  term  and  let  A'^X  be  the  constraint  force,  then  the  Lagrange  equations 
of  motion  are  defined  as  j  /^r\  fir*  ^ 


ot  motion  are  aeiiiieu  «K>  a  f  fir\  fiC  ^  f7\ 

=  (7) 

dt  \dxj  dx 

with  the  Pfaffian  non-holonomic  constraint 

A{x)x  +  b{t)  =  0 

The  partial  derivatives  of  the  system  Lagrangian  £  are 

%  =  M{x,i)x-k-G{x,t) 
ax 

BC  (10) 

The  resulting  standard  Lagrange  equations  of  motion  are 

+  (m  -  (“) 

The  above  equations  of  motion  are  a  second  order  nonlinear  different^ 
generally  a  simple  task  to  solve.  In  particular,  the  time  ^d  state 

Lses  a  particular  difficulty.  These  standard  equations  of  motion,  when  coupled  to  the  constrmnt 
equations  in  Eq.  (8),  pose  a  more  significant  challenge,  especially  for  high  dmensioned  systems.  Th 
SciSyoTsUhig  systems  of  ordS  n-hm  to  obtain  (x,A)  for  each  (x,x,t)  hes  at  the  heart  of  the 

difficulty. 

THE  BOLTZMANN-HAMEL  EQUATIONS  OF  MOTION 

We  motivate  this  development  using  rigid  body  dynamics  wherein  it  is  common  practice  to  sep^e 
the  momentum  dynamics  and  kinematics.  Euler’s  equation  of  motion  are  usually  written  m  te 
oi  the  body  angulS  velocity  w,  not  in  terms  of  the  time  derivative  of  the  attitude  coordmate  vector 

9w  =  — [w]3fw  -F  ti  (13) 

e  =  /(0)w 

The  first  equation  of  Eqs.  (13)  describes  the  system  momentum  time  rate  of  chmge,  the  second 
I^cn^ el  tr^ematicilaUship  between  the  body  angular  velocity  and  the  attitude  coordmate 
derivative.  Only  using  6  and  its  inertial  derivatives  would  yield  a  much  more  complex  second  or 
differential  equation. 

This  separation  of  dynamics  and  kinematics  in  the  equations  of  motion  cannot  be  accomplished  in 
more  general  dynamiiTsystems.  However,  we  show  a  way  to  accomplish  an  analogous  structure  m 
the  system  equations,  at  the  expense  of  increasing  the  number  of  differentia  equations  * 

This  involves  projecting  the  configuration  coordinate  denvative  mto  a  / 

introducing  a  quasi-velocity  vector  which  diagonahzes  the  mass  matrix.  Smce 
always  symmetric  and  positive  definite,  it  can  be  spectrally  decompos^  using 
eigenvector  matrix  E  and  the  diagonal  positive  real  eigenvalue  matrix  D.  Instead  of  usmg  E,  let  us 

useC-E  .  m-cFdC  CCF  =  I  D  =  diag{Xi)  (1*^) 
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Let  the  diagonal  S  matrix  be  defined  as  the  positive  square  root  of  the  eigenvalue  matrix  D. 

S  =  VD  =  diag  D  —  S  (15) 

Substituting  Eqs,  (14)  and  (15)  into  Eq.  (5)  yields  the  following  kinetic  energy  expression. 

Ti  =  ii’’C^5^5Cx  (16) 

By  introducing  the  same  velocity  coordinate  vector  r?  as  in  Ref.  2 

T]  =  SCx  rj  =  t)(Xi(x),Ci(x),i)  (17) 

we  obtain  a  new  simplified  expression  for  the  kinetic  energy.  The  mass  matrix  associated  with  tj  is 
the  identity  matrix,  so 

T*  =  Tj*  +  Ti*  +  To*  =  +  G^(x,  t)Cf^S-^r]  +  To  (x,  t)  (18) 

Note  that  depends  explicitly  on  q.  However,  if  we  choose  (x,  x)  as  the  independent  set  for  taking 
partial  derivatives,  we  must  recall  that  q  depends  on  (x,  x).  The  x  dependence  is  implicit  in  Eq.  (14), 
(15),  (17)  because  S(x),  C(x)  parameterize  ikr(x)  =  SC.  Also  note  that  T"  is  equal  to  T  (both 
represent  the  same  physical  kinetic  energy  quantity),  they  differ  only  in  their  algebraic  formulations. 
The  inverse  mapping  of  Eq.  (17)  describes  the  kinematic  relationship  between  x  and  q  similar  to  the 
relationship  of  0  and  w  in  Eq.  (13),  except  for  the  orthogonality  of  C  and  the  diagonal  nature  of  5 
make  the  inverse  near-trivial. 

X  =  C'^S-^q  (19) 

The  partial  derivatives  of  the  system  Lagranpan  C  are  now  rewritten  in  terms  of  the  new  generalized 
velocity  vector  q  using  the  chain  rule.^ 

dc  ar*  arj^ar*  _  ,201 


^  ^  jT^  _  dv 

dx  dx  dr]  dx 

where  J  is  the  sensitivity  matrix  of  t]  with  respect  to  the  state  vector  x, 
since  the  C  and  S  both  indirectly  depend  on  x. 


(21) 

This  matrix  is  non-zero 


“  dxi^  ^  dxn. 


(22) 


Using  the  chain  rule  dqfdxk  is  expressed  as 


dxk  V^xjt 


+  5 


dC 

dXk 


5-^ 


The  partial  derivatives  of  T*  with  respect  to  q  and  x  are 

ffp* 

^-^q.^S-'^CG{x,t) 

dq 

ar*  _  a(^(x,t)^T^.i_  aro(x,f) 

dx  dx  dx 

With  all  these  substitutions  the  Lagrange  equations  of  motion  in  Eq.  (7)  become 


(23) 


(24) 

(25) 


I  (C^S,+  G)  -  ^  Vs-,  -  ^  +  S-.CG)  + =  0  -  a’-A  (26) 


4 


010 


After  carrying  out  the  time  derivative  and  using  the  orthogonaUty  of  the  C  matrix,  the  following 
first  order  differential  equation  is  obtained. 

i, + S-'  fcc’-s + s  -  ,  =  s-'CF  -  an  (27) 

B  =  AC’'S-‘  (28) 


and 


Q.^.6+^+J'^s-'ca 

^  dx  dx 


(29) 


The  two  first  order  equations  (19)  and  (27)  replace  the  classical  second  order  Eq.  (12).  Eq.  (27)  is 
an  interesting  new  form  of  the  well-known  the  Boltzmann-Hamel  equation  •  for  our  Aoice  of  qu^i- 
coordinates  17-  This  diagonahzed  equation  of  motion  is  very  sin^ar  to  the  one  introduced  m  Refi  2 
except  for  the  parameterization  of  the  eigenvector  matrix  and  the  form^ation  of  the  Cono^  te^ 
Note^that  no  matrix  inverse  needs  to  be  taken  thanks  to  the  orthogonality  of  the  C  matrix.  Inyertmg 
the  S  matrix  is  trivial  since  it  is  a  positive  diagonal  matrix.  At  this  stage  the  expensive  matrix  mvCTse 
problem  has  been  traded  for  another  problem  involving  finding  the  eigenfactor  denvatives  and  the 
sensitivity  matrix  J. 


MASS  MATRIX  EIGENFACTOR  DERIVATIVES 

To  solve  the  Boltzmann-Hamel  equation,  we  seek  auxiliary  differential  equations  to  yield  the 
eigenfactor  derivatives  with  respect  to  time  and  the  state  vector  i,  since  by  solving  these  we  can 
establish  the  instantaneous  C7,  5  and  J  matrices.  A  very  elegant  square  root  alpnthm  developed 
by  Oshman  and  Bar-Itzhack  to  solve  the  matrix  Riccati  differential  equation  is  extended  here  to 
solve  for  the  mass  matrix  eigenfactor  derivatives. 

This  square  root  algorithm  works  very  well,  even  with  repeated  eigenvalues  and  clusters  of  near¬ 
equal  eigenvalues.  Assume  that  the  mass  matrix  Af  has  fc  distinct  eigenvalues,  each  with  an  algebraic 
multiplicity  of  m,-,  then  let  the  eigenvalues  of  Af  be  ordered  as 


(30) 


and  equate  this  series  to  the  series  Ai, . . . ,  A„.  The  ordered  eigenvalue  matrix  D  is  then  given  by 

D  =  diag{Xi,. . .  ,A„)  (3^) 


Let  Ci  be  the  i-th  row  of  the  C  matrix.  Since  C  is  the  transpose  of  E,  Ci  is  simply  an  eigenvector 
written  as  a  row  vector.  Let  *Cj  be  the  i-th  eigenvector  corresponding  to  the  j-th  eigenvalue.  All  n 
eigenvectors  are  ordered  according  to  their  respective  eigenvalues  in  the  following  manner. 


(32) 


Every  proper  orthogonal  matrix  satisfies  a  differential  equation  of  the  form 

C  =  -QC 


(33) 
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0 

1*1^1 

for  Si  =  Sj 

for  1  ■  1  ^  i^mox 

where  is  a  skew-symmetric  matrix.  AU  eigenfactor  derivatives  of  M  are  expressed  by  a  projection 
onto  Cl, . . .  ,c„  in  terms  of  mj  .  t 

The  distinct  elements  of  the  fl  matrix  elements  are**’’ 


(35) 


where  fima*  is  a  maximum  bound  for  the  n  matrix  entries  depending  on  the  accmacy  of  the  software 
used.  This  term  is  included  to  smoothly  handle  the  case  where  Aj  is  almost  equal  to  Aj.  Ref.  4  shows 
that  this  sUght  modification  has  minimal  impact  on  the  accuracy  of  the  solution.  i 
the  eigenvector  variations  corresponding  to  the  clustered  eigenvalues  have  neghgible  mfiuence  on  the 

diagonalization.  of  M. 

The  time  derivative  of  the  eigenvalues  A,-  are^*^ 

A.-  =  p..  (36) 

However,  the  time  derivatives  of  the  eigenvalues  are  not  required,  but  the  derivative  of  the  squ^e 
root  of  the  eigenvalues.  Let  s.-  be  the  i-th  entry  of  the  5  matrix.  Using  the  cham  rule,  the  denvative 

of  5i  is  -  ^  ^ 

k-  -  —A-  (37 

“  2s.-  ’ 


This  is  written  in  a  more  compact  form  using  the  diagonal  matrix  F 

r  =  diag{nii) 


as^ 


j  =  lrs-' 


(38) 

(39) 


Substituting  the  above  eigenfactor  time  derivatives  into  Eq.  (27),  the  Boltzmann-Hamel  equations 
are  reduced  to 


fl  +  S-^  (nS+^TS-^-CJ’^-C^  C^5-^^»7  =  5 


(40) 


At  first  glance,  Eq.  (40)  may  seem  more  complicated  than  than  the  original  equations  of  motion. 
Keep  in  mind,  however,  that  5  and  F  are  diagonal  matrices  which  greatly  reduces  the  computational 

burden. 

To  be  able  to  calculate  the  sensitivity  matrix  Jj  we  stiU  need  an  expression  for  dS/dxk  ^d 
dC/dxk.  Note  that  in  the  previous  development  of  S  and  C  it  did  not  matter  with  respect  to  what 
variable  the  derivative  was  taken.  This  allows  dS/dxk  and  dC/dxk  to  be  expressed  in  a  very  similar 
manner  as  were  S  and  C.  Let  be  defined  as 

t  dMix,t)  T 


(41) 


and  the  diagonal  matrix  ^'F  be 

‘f  =  diag  {‘‘pa) 

The  partial  derivative  of  5  with  respect  to  Xk  has  the  same  form  as  the  time  denvative  of  5  in 
Eq.  /eqrefdSl  as  oc  , 

=  s*r.s-‘ 

dxk  2 


(42) 
S  in 

(43) 
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To  find  BCfdxk,  let  the  skew-symmetric  matrix  be  defined  as 

‘  [ny]  = 


for 

4^ 

0 

for  Si  =  Sj 

^  ^maiSign 

for 

‘a  - 

Sj  s. 

(44) 


>fi« 


dC/dxk  is  then  defined  analogously  to  the  time  derivative  of  Eq.  (33)  as 


dXk 


Using  Eqs.  (43)  and  (45)  dnidxk  can  be  written  as 

.  =  7? 


dr] 

dXk 


(45) 


(46) 


PFAFFIAN  NON-HOLONOMIC  CONSTRAINTS 

If  the  dynamical  system  is  unconstrained,  then  the  Pfaffian  constraint  matra  B  be  zero  and 
Eq  (40)  iJfully  defined.  However,  if  the  dynamics  axe  constrained  through  the  Pf^^ 

Turfa^e  Wen  in  Eq.  (8),  then  Eq.  (40)  will  need  to  be  solved  simultaneously  with  the  constramt 
equation  Using  Eq.  (19)  we  rewrite  the  Pfaffian  constraint  in  terms  of  the  new  velocity  vector  tj. 

AC'^S-^T]  +  b  =  0 

which  can  be  simplified  using  Eq.  (28)  to 

577-1-6  =  0 

The  dynamic  constraint  equations  is  obtained  by  taking  the  first  time  derivative  of  Eq.  (48) 

BT]  +  Br)  +  b  =  0 

Using  Eqs.  (39),  (33)  and  (28)  B  can  be  expressed  as 


(47) 

(48) 

(49) 


B  =  (AC^  +  -  AC^5-‘S)S-^  =  (AC^  +  AC^(0  -  \s  S  ^ 


(50) 


To  determine  (77,  A),  Eq.  (40)  will  need  to  be  solved  simultaneously  with  Eq.  (49),  we  are  led  to  the 
differential-algebrEdc  system 


which  C3n  be  written  in  more  compact  form  as 


(52) 


Since  5  is  a  mxn  matrix,  ^2  is  a  symmetric  (7i-F7n)x(7i-l-m)  matrix.  A  partitioned  matrix 
inversion  formula^  is  used  to  find  the  inverse  of  M2.  Because  of  the  use  of  the  qu^-coor^ates 
T],  the  upper  left  partition  of  M2  is  a  Tixn  identity  matrix  whidi  smplifies  the  partitioned  mverse 
immensely.  For  this  case  the  mxTn  Schur  complement  A  reduces  to 

A  =  55’’ 
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A-^B 


J5^A-n 

-A-^  J 


Then  the  partitioned  inverse  of  M2  is 

Using  in  Eq.  (54)  the  constrained  differential  equation  of  motion  for  rj  is 

,)  =  (/-  B'^A-^B)ai  +  B'^A-^at 

The  Lagrange  constraint  vector  A  is 

A  =  A~^Bai  -  A“*a2 


(54) 


(55) 


(56) 


Note  that  if  zero  constraint  is  imposed  on  the  dynamical  system  then  Eq.  (55)  collapses  back  to 
Eq.  (40).  If  the  number  of  system  constrdnts  m  is  small,  then  A"  could  be  inverted  for  eaA  tune 
step.  However,  as  m  grows  larger  taking  a  numerical  inverse  quidcly  becomes  computationally  very 

expensive. 

Since  A,  for  linearly  independent  constraints,  is  a  positive  definite  symmetric  matrix  by  Eq.  (53), 
it  can  be  decomposed  using  the  eigenfactor  parameterization  analogous  to  the  mass  matrix  parame¬ 
terization.  Let  Ci,  be  the  transpose  of  the  eigenvector  matrix  of  A,  and  let  Sa  be  a  diagonal  matrix 
whose  entries  are  the  positive  roots  of  A  eigenvalues.  Then  through  a  spectral  decomposition  A  can 

be  written  as  ^  ^ 

A  =  CI5I5aC'a  (57) 

Since  Ca  is  an  orthogonal  matrix  and  the  diagonal  entries  of  5a  are  all  positive,  the  inverse  of  A  is 

A-'  =  ClSZ^C^  (58) 

This  direct  inverse  formulation  reduces  Eq.  (55)  to  the  following  matrix  inverse  free  formulation. 

,)  =  (/-  B-^ClS^^C^B)a,  +  B'^ClS^^C^a2  (59) 

Keep  in  mind  that  5a  is  a  diagonal  matrix  with  positive  entries.  Therefor  finding  its  inverse  involves 
only  scalar  inversions. 

To  update  the  Ca  and  5a  matrices  without  resolving  the  eigenvalue,  eigenvector  problem,  then 
time  derivatives  are  found  using  the  square  root  eigenfactor  algorithm^  analogously  to  finding  the 
time  derivatives  of  C  and  5  of  the  mass  matrix  M.  Assume  all  eigenvectors  and  eigenvalues  are 
arranged  as  described  in  Eq.  (30)  and  (32).  Let  ca.-  be  the  i-th  row  vector  of  the  Ca  matrix,  then 
Bit  is  defined  be  ^  ^ 

=  (50) 

where  the  time  derivative  of  A  is  ,  ,  _  ^ 

A  =  BB'^  +  BB^  (61) 

and  B  was  defined  in  Eq.  (50).  The  diagonal  matrix  Ta  and  the  skew-symmetric  matrix  JIa  are 
then  defined  as 

Ta  =  diagm  (52) 


(flA  J  = 


0ii 

0 


for 

The  time  derivatives  of  Ca  aad  5a  are  then  written  as^ 

5a  =  |rA5^‘ 
Ca  “  ““J^aCa 


(63) 


(64) 

(65) 


SOLUTION  METHOD  OUTLINE 

A  method  has  been  presented  that  brings  a  general  class  of  constrained  multi-body  dynami^  to 
a  form  which  completely  avoids  the  necessity  of  inverting  configuration-variable  matrices  to  obtam 
instantaneous  accelerations.  The  form  of  the  equations  is  very  analogous  to  classical  dynam- 
ics/kinematics”  quasi-coordinate  development  of  rigid  body  dynamics.  The  eigenvalue  eigenvector 
problem  is  only  solved  once  numerically  to  find  the  initial  5(fo)»  C{to),  5a (to)  and  CA(to)  matrices 
as  outlined  in  Figure  1.  After  initially  ordering  the  eigenvalues  and  eigenvectors  as  outhned  earher 
they  will  need  need  to  be  simply  rearranged  thanks  to  the  square  root  algorithm.  Instead  of  usmg 
the  generalized  coordinate  derivative  i  as  the  velocity  measure,  a  new  quasi-velocity  t]  is  introduced 
to  which  corresponds  an  identity  mass  matrix. 

To  evaluate  the  eigenfactor  derivatives  it  is  assumed  that  M{x,t)  and  dMldxk{x,  t)  are  available 
algebraicaUy.  This  is  a  feasible  assumption,  especially  in  view  of  the  several  modem  softw^e  packages 
like  Maple  and  Mathematica  which  can  derive  the  mass  matrix  in  an  explicit  alpbraic  form  and 
automate  the  generation  of,  for  example,  the  C-code  to  compute  M(x,t)  and  dM{x,t)/dxk. 


Figure  1  Flow  Diagram  of  Eigenfactor  Square  Root  Algorithm 

For  a  constrained  dynamical  system,  traditional  processes  lead  to  the  classical  Lagrange  equations 
of  motion  coupled  to  second  order  differential  constraint  equations  where  a  time  and  configuration 
varying  mass  matrix  needed  to  be  inverted.  In  the  present  development,  there  are  no  matrix  inverse 
operations.  These  equations  are  mapped  into  a  set  of  simpler  nonlinear  first  order  differential 
equations.  The  second  order  differential  equation  for  x  is  replaced  with  two  first  order  differential 
equations  17  and  i.  The  mass  matrix  inverse  problem  is  side-stepped  by  introducing  the  mass  matrix 
eigenfactor  matrices  and  solving  their  usually  well-behaved  differential  equations  for  S  and  C  instead. 
This  method  has  no  second  coupled  constraint  equation,  since  the  constraint  force  was  already  solved 
for  and  bacJc-substituted  into  the  equation  of  motion  for  77.  However,  this  involved  taking  the  inverse 
of  a  symmetric  Schur  matrix  A.  This  inverse  can  also  be  avoided  very  simply  by  using  the  eigenfactor 
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matrices  of  the  Schur  matrix  instead  of  the  Schur  matrix  itself.  Therefor,  again  a  matrix  inverse  is 
replaced  by  solving  two  first  order  differential  equations  and  Ca*  Evaluation  of  operation  count 
and  error  propagation  issues  shows  solving  these  differential  equations  to  be  vastly  superior  to  the 
conventional  approach  requiring  matrix  inversions. 

To  solve  the  above  first  order  differential  equations  many  types  of  integration  methods  could  be 
used.  However,  the  Runge-Kutta  type  methods  are  not  attractive  since  they  require  the  derivatives 
to  be  evaluated  at  discrete  point  between  the  time  steps.  This  poses  a  problem  when  evaluating  J 
at  these  intermediate  steps  since  it  depends  on  dS/dx  and  dC/dx.  It  would  reqtiire  resolving  the 
eigenvalue  eigenvector  problem  at  these  intermediate  steps  to  find  the  proper  J  matrix.  Clearly  not 
a  desirable  solution. 

In  lieu  of  using  Rimge-Kutta  or  analogous  single-step  methods,  it  is  recommended  that  a  predictor- 
corrector  type  method  is  used.  These  methods  only  evaluate  the  derivatives  at  the  integration  steps 
and  not  in  between  them.  A  very  stable  and  accurate  predictor  corrector  type  method  is  the 
Hamming’s  method.®  Its  accuracy  is  A®,  comparable  to  the  4-th  order  Rimge  Kutta  method.  One 
drawback  of  the  predictor  corrector  method  is  that  they  are  not  self  starting.  Another  method,  such 
as  the  modified  Euler  method,®  can  be  used  to  establish  the  starting  table. 

DUAL-LINK  MANIPULATOR  SIMULATION 

To  demonstrate  the  eigenfactor  square  root  algorithm,  a  constrained  dual-link  mampulator  motion 
is  simulated.  The  shoulder  joint  is  fixed  and  the  elbow  joint  is  free  to  rotate.  The  link  from  the 
shoulder  to  the  elbow  has  a  length  /i  =  0.5  and  the  link  from  the  elbow  to  the  hand  has  a  length 
I2  =  I/V2.  Both  links  are  assumed  to  be  mass-less.  The  elbow  mass  and  the  hand  mass  are 
mi  =  m2  =  1.  The  hand  is  connected  to  a  point  (0,4)  through  a  spring  with  a  stiffness  if  =  1.  The 
system  constraint  restricts  the  hand  to  move  only  horizontally  as  illustrated  in  Fig.  2.  There  are  no 
non-conservative  forces  or  torques  acting  on  the  system. 


The  hand  coordinates  (a:,y)  are  given  as 

X  =  /i  cos  $1  -f  I2  cos  62  (66) 

y  =  li  sin0i  + 12  sin02  (67) 

The  system  potential  energy  is  the  total  spring  energy  given  as 

1 


The  system  kinetic  energy  is  pven  as 


T  =  \m1ll0l  +  ^012  (liOl  +  2/il2  cos  (01  —  02)^1  ^2  +  ^2®l) 


Prom  the  kinetic  energy  T  the  system  mass  matrix  can  be  extracted. 

n4-/a\  r  ("*i  +  '^2)^1  1712/1/2  cos  (01  —  02) 

'  '  [ 7712/1/2  cos  (01  —  62)  7n2/2 


The  dgenfactor  square  root  algorithm  requires  an  algebraic  expression  for  M  and  dM/dOk.  They 
are  found  directly  from  the  system  mass  matrix  M  in  Eq.  (70). 

•V//.  r  0  m2/i/2sin(0i  —  t/ie/a2)(02  —  ^i)l 

(  ’  ^  “  [7712/1/2  sin (01  —  theta2){02  —  ^1)  0  J 

5Af  _  r  0  — m2/i/2sin(0i  —  02)1  7^2) 

“  [-7712/1/2  sin  (01 -02)  0  J 

dM  0  7712/1/2  sin  (01  —  02)1  7731 

“  [7712/1/2  sin  (01 -02)  0  J 

The  system  constraint  on  m2  is  y  =  0.  Using  Eq.  (67)  this  can  be  expressed  as  .4(0)0  =  0  where 

A(0)  =  [  /i  cos  01  /2  cos  02  ]  (74) 

The  simulation  is  started  at  rest  with  0i  =  0®  and  02  =  60®  and  let  run  for  10  seconds.  The 
integration  step  size  is  0.001  seconds.  The  resulting  motion  is  shown  in  Fig.  3  below. 


-1  -0.75  -0.5  -0.25 
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Figure  3  Dual-Link  Manipulator  Motion 

Clearly  the  Pfaffian  constradnt  was  successfully  incorporated  into  the  equations  of  motion.  The 
hand  only  moved  in  a  horizontal  manner.  Not  having  to  solve  auxihairy  constr^nt  equations  is 
of  great  importance  as  the  ntimber  of  constraints  increases.  Since  this  is  a  very  simple  dynamical 
system,  an  exact  inverse  was  found  of  the  system  mass  matrix  and  used  to  forward  integrate  the 
classical  Lagrange  equations  of  motion  to  verify  the  new  equations  of  motion.  The  results  were 
identical  to  solving  the  {fj,  x)  dynamical  system. 

One  critical  case  of  the  eigenfactor  square  root  algorithm  is  when  two  or  more  eigenvalues  are 
clustered  very  closely  around  one  value.  In  this  case  elements  of  ft  could  go  to  infinity.  This  case 
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Figure  4  Time  History  of  the  Eigenvalue  Square  Roots 


was  resolving  by  putting  an  maximum  bound  on  the  magnitude  of  the  eigenvalue  square  roo^. 
bound  is  usually  set  to  machine  accuracy  (i.e.  10^®  for  this  simulation).  As  can  be  swn  in  Fig;  (4) 
the  two  eigenvalue  square  roots  start  out  distinct  and  periodically  become  equal.  The  condition 
s,  =  S2  means  geometrically  that  l^i  -  62]  is  90°  or  that  the  lower  arm  is  pei^endicul^  to  the 
upper  arm.  The  eigenfactor  square  root  algorithm  did  not  appear  to  have  any  handling 

this  numerical  singularity.  Not  even  after  repeatably  going  through  this  condition.  These  r^^ts 
seem  to  confirm  some  of  the  robustness  predictions  made  in  Ref.  4  about  the  square  root  algorithm. 


Figure  5  Time  History  of  the  Total  Energy  Integration  Error 


Since  all  the  forces  and  torques  acting  on  the  dual-link  manipulator  are  conservative,  the  total 
system  energy  should  be  constant.  This  makes  the  total  energy  a  good  integration  error  che(i  and 
is  shown  in  Fig.  5.  Since  the  motion  starts  out  at  rest,  the  integrations  remains  very  small  initially. 
As  the  motion  gains  momentum,  the  integration  error  starts  to  accumulate  very  slowly.  The  forward 
integration  was  performed  with  only  performing  the  predictor  and  corrector  process  once.  For  the 
same  step  size  the  error  could  be  further  reduced  by  repeatably  applying  the  P-C  method  during  the 
forward  integration.  This  is  possible  since  P-C  methods  allow  the  integration  error  to  be  estimated 
during  the  forward  integration. 

CONCLUSION 

The  eigenfactor  square  root  algorithm  can  successfully  solve  a  very  large  class  of  nonlinear  dy¬ 
namical  systems.  The  classical  second  order  Lagrange  equation  of  motion  is  replaced  with  two  first 
order  differential  equations  by  introducing  a  new  quasi- velocity  tj.  Pfaffian  constr^nts  can  be  ac¬ 
counted  for  directly  in  the  new  equations  of  motion.  Constraint  equations  no  longer  need  to  be 
solved  simultaneously  with  the  dynamical  equation  thus  greatly  reducing  the  computational  burden. 
Any  inverse  of  a  symmetric  positive  definite  matrix  such  as  the  mass  matrix  is  replaced  with  the 
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problem  of  solving  the  respective  eigenvalue  and  eigenvector  matrix  &st  t^^^ 

Numerical  simulations  for  a  dual-link  manipulator  confirm  the  vahdity  of  the  method.  Using  the 
square  root  algorithm  for  solving  the  eigenfactor  differential  equations  appears  to  be  very  robust 
even  when  some  eigenvalues  are  clustered  closely  together. 


“Methods  of  Applied  Dynamics,”  NASA  Reference  PubU- 


REFERENCES 

1.  M.  H.  Rheinfurth  and  H.  B.  Wilson  “ 
cation  1262,  NASA,  May  1991. 

2.  A.  Jain  and  G.  Rodrigue2.  “Diagonalized  Lagrangian Robot  Dynamics,”  IEEE  Trans,  on  Robotics 
and  Automation,  Vol.  11,  No.  4,  Aug.  1995,  pp.  571—584. 

3.  E.  Bayo  and R.  Ledesma.  “Augmented Lagranian  and 

Constrained  Multibody  Dynamics,”  Journal  of  Nonlinear  Dynamics,  Vol.  9, 1996,  pp.  113 

4.  Y.  Oshman  and  I.  Y.  Bar-Itzhack.  “Eigenfactor  Solution  of  the  Matrix  Ric^ti  Eq'iation  -  A 
Continuous  Square  Root  Algorithm,”  IEEE  Trans,  on  Automatic  Control,  Vol.  AC-30,  No.  10, 


Oct.  1985,  pp.  971-978. 

5.  J.  G.  Papastavridis.  “On  the  Boltzmann-Hamel  Equations  of  Motion:  A  Vectorial  Treatment, 
Journal  of  Applied  Mechanics,  Vol.  61,  June  1994,  pp.  453-459. 

6.  J.  L.  Junkins  and  J.  D.  Turner.  Optimal  Spacecraft  Rotational  Maneuvers.  Elsevier  Science 


Publishers,  Netherlands,  1986. 

7.  J.  L.  Junkins  and  Y.  Kim.  Introduction  to  Dynamics  and  Control  of  Flexible  Structures.  AIAA 
Education  Series,  Washington  D.C.,  1993. 

8.  M.  L.  James,  G.  M.  Smith,  and  J.  C.  Wolford.  Applied  Numerical  Methods  for  Digital  Computing. 
Harper  k  Row,  Publishers  Inc.,  New  York,  3rd  edition,  1985. 


13 


221 


AAS  95-417 


GLOBALLY  STABLE  FEEDBACK  LAWS  FOR 
NEAR-MINIWIUM-FUEL  AND  NEAR-MINIMUM-TIME  POINTING 
MANEUVERS  FOR  A  LANDMARK-TRACKING  SPACECRAFT 


Hanspeter  Schaub*,  Rush  D.  Robinett^  and  John  L.  Junkins 

Utilizing  unique  properties  of  a  recently  developed  set 

ters  the  n^ifi^  Rodrigues  parameters,  a  feedforward/feedback 

Sntrol  laws  is  developed  for  a  spacecraft  undergoing  '^f9®  ^nline^ 

tions  using  three  reaction  wheels.  The  me^od  is  suitable  9 

given  reference  trajectories  that  spline  smoothly  into  a 

fSS  traiector^  may  be  exact  or  approximate  solutions  of  me 

system  equations  of  motion.  An  associated  asymptotically  stable  n^n- 

£r  obsenrer  is  formulated  for  state  estimation.  In  partcular  we 

the  Ideas  usng  both  near-minimum-time  and  near-m|nifnum 

about  Euler's  principal  rotation  axis,  with  pararneterization  of  the  ste^ 

ness  of  the  control  switching  for  each  class  of 

punov  stability  theory  is  used  to  prove  rigorous  dobal  asyrnptohc  st^^ 
of  the  cIosed-Io(^  motion  In  the  end  game  and  '®9  the  trackirig  o^e 

SSlavTfor^aVlIJo^^^ 

sharpness  of  all  control  switches,  to  enhance  the  kackability  of  the  r^r 
enc6  maneuvers  in  the  presence  of  structural  flexibility. 

INTRODUCTION  „-  r 

Motivated  by  problems  arising  in  the  precision  pointing  of  imaging  ^telbl^  for 
non-proliferation  and  environmental  monitoring  applications,  there  is  renewed  in  ^pro^ 

lem  of  rapid  large  angle  maneuvers  followed  by  precision  pointing/tracking  of  bndm^from 
near-earth  oibits.  Pointing  and  tracking  tolerances  for  these  imaging  systems  are  on  the  order  of 
microradians.  There  are  many  contributors  to  pointing  error,  but  the  vibrauonal  drsmrb^i^ 
duced  by  the  effects  of  rapid  maneuvers  on  flexible  solar  array  structures  are  one  ' 

In  previous  studies^'^  it  has  been  shown  that,  assuming  sufficient  sensor  and  actnaior  bandwidA, 
reaction  wheel  actuators  can  effectively  control  both  the  rigid  body  maneuver  ^d 
fine-pointing/vibration  arrest;  however,  the  key  issue  is  to  perform  the 
torque-shaped  fashion  that  minimizes  disturbances  of  the  flexural  mouon.  Judiaoos  torque  sha^ 
ing  must  be  coupled  with  stabilizing  feedback  control  to  null  tracking  ^d  f 
Ste  approach  ported  hercio.  Wc  ro  extend  the  developments  of  Ref.  U  to  ^ 

bally  asymptotically  stable  nonlinear  control  design  approach  of  broad  applicanflity  to  general 
three-dimensional  pointing  and  tracking  problems. 

*  Graduate  Research  Assistart,  Aerospace  Engineering  Department.  Texas  A&M  University.  Coliege  Sslion  TX  77843. 

t  Research  Engineer  atSandia  Natio«al  Uboratories.  Albuquerque.  NM  87185. 

t  George  Eppright  Chair.  Professor  of  Aerospace  Engineering.  Aerospace  Engineering  Department.  Texas  AiM  Unive  ty. 
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In  recent  papers^^,  the  utility  of  a  new  set  of  orientation  parameters  (the  modified  Rodrigues 
parameters,  MRPs)  has  been  studied.  It  has  been  shown  that  these  parameters  have  soine  outstand¬ 
ing  properties.  They  appear  to  be  the  canonical  three  parameter  set,  owing  to  the  following  remark¬ 
able  truths: 

•  The  nonsingular  motion  range  encompasses  i3^  degrees,  although  the  norm  of  the 

tend  to  infinity  as  ±360  degrees  rotations  about  any  axis  is  approached. 

•  For  rotations  within  ±180  degrees  about  any  axis,  the  parameters  are  bounded  by  a 
norm  of +1. 

•  The  differential  equations  are  quadratic  nonlinear  functions  of  the  MRPs, 

and  have  no  singular  points  for  rotations  less  than  ±360  degrees. 

•  The  transformation  from  orthogonal  components  of  angular  velocity  to  the  de¬ 
rivatives  of  the  MRPs  involves  a  coefficient  matrix  with  orthogonal  rows  and  col¬ 
umns,  thus  the  inverse  is  analytic. 

•  The  MRPs  are  non  unique,  there  are  two  trajectories  corresponding  exactly  to  a 
given  physical  motion.  One  of  the  trajectories  at  any  instant  of  time  lies  w^m  and 
the  other  lies  outside  a  unit  sphere.  Both  trajectories  satisfy  the  same  differential 
equations,  only  differing  in  initial  conditions. 

Regarding  the  last  property,  it  is  easy  to  establish  the  transformation  between  the  correspond¬ 
ing  points  on  the  two  trajectories,  and  this  fact  can  be  utilized  to  establish,  for  the  first  time,  a  glo¬ 
bally  nonsingular  three  parameter  description  of  a  generally  tumbling  rigid  body. 

These  properties,  together  with  recent  results  from  Lyapunov  control  law  design  methods  , 
enable  the  formulation  of  a  most  attractive  and  effective  faniily  of  control  laws  for  spacwr^t  atu- 
tude  maneuvers  and  fine  pointing.  The  control  law  design  methodology  is  important  in  its  own 
right,  as  distinct  fiom  die  use  of  the  MRPs  as  orientation  coordinates.  In  particular,  however,  this 
control  law  design  approach  is  especially  attractive  for  this  coordinate  choice.  The  feedback  law  is 
dominated  by  linear  tenns  for  this  approach  with  a  judicious  choice  of  a  logarithmic  Lyapunov 
function^.  The  analytical  results  presented  herein  are  illustrated  through  a  simulation  study  which 
supports  the  efficacy  and  practicality  of  the  concepts  introduced. 


FORMULATION 


The  Equations  Of  Motion  For  A  Rigid  Spacecraft 

The  spacecraft  is  assumed  to  have  three  reaction  wheels  with  distinct  inertia  aligned  with  the 
three  body  axes  to  control  its  attitude.  Each  reaction  wheel  inertia  about  the  respective  spin  aius  is 
given  by/-  Let  the  inertia  matrix  3  contain  the  spacecraft  and  the  transverse  reaction  wheel  iner¬ 
tia  and  let  'the  matrix  /  be  defined  as 


/  = 


r/i  0  O-j 
0  /2  0 
-0  0  /3J 


(1) 


Let  C&B/iv  be  the  spacecraft  body  angular  velocity  vector  relative  to  an  inertial  frame  N  and  let 
the  vector  contain  tbe  angular  velocities  of  each  reaction  wheel.  The  rotational  equations  of 
motion  can  be  written  as^ 

S^^^=-[©fi/Ar]32>a>w  “[wa/v]/(i2  +  0£iyw)  -«+/  (2) 

at 

where  the  control  vector  u  also  satisfies  the  reaction  axial  wheel  equation  of  motion: 


(3) 


The  tilde  matrix  [©]  is  defined  as 


■  0  -(03  (02  1 

[©]=  (03  0  -(Oi  (4) 

l-(02  (0i  0  , 

and  the  vector  /  is  the  sum  of  all  external  to  ques  acting  on  the  spacecraft  These  torques  are 
in  part  due  to  aerodynamic  and  solar  radiation  drag  and  are  usually  considered  to  be  very  sm^ 
compared  to  the  internal  torques  being  applied.  They  are  assumed  to  have  a  known  bound  F 
which  is  defined  as  I/’/I  ^  F/ . 

Attitude  Coordinates 

4-9 

All  spacecraft  orientations  are  described  using  sets  of  modified  Rodrigues  parameters  . 
They  are  a  minimal  coordinate  representation  of  a  rigid  body  attitude  with  several  useful  attrib¬ 
utes.  They  can  be  defined  in  terms  of  the  Euler  parameters  (quaternions)  as 

--  i=  1,2,3 


1  +  Po 


or  in  terms  of  the  principal  rotation  axis  e  and  the  principal  rotation  angle  (j)  as 

d  =  e-tan<l)/4  (6) 

Obviously  they  go  singular  at  a  principal  rotation  of  ±360®  where  po  ~  1  •  What  makes 
this  set  very  attractive  is  that  this  singularity  can  be  completely  avoided  by  making  use  of  the  fact 
that  the  modified  Rodrigues  parameters  are  not  unique.  Notice  that  reversing  the  sign  of  the  p’s  in 
Eq.  (5)  generates  a  second  set  of  a’s.  The  alternate  set  is  called  the  ‘‘shadow  set’  ,  and  goes  singu¬ 
lar  at  zero  rotations  and  is  very  well  behaved  around  the  ±360®  rotations.  Hence,  if  a  singularity  is 
approached  with  the  original  set,  one  can  switch  the  attitude  description  to  the  “shadow  set”  and 
avoid  the  singularity  at  the  cost  of  having  a  discontinuity  at  the  switching  point.  The  transforma- 
tion  between  “original’*  and  “shadow”  set  is 

<jf=-0//a^6  j  =  1,2,3  (7) 

Keep  in  mind  that  the  choice  in  distinguishing  “original”  and  “shadow  sets  is  purely  arbitrary. 
Both  sets  describe  the  same  physical  orientation.  In  this  study  the  switching  condition  was  chosen 
to  be  =  1  .  This  causes  the  magnitude  of  the  orientation  vector  to  be  bounded  between 
0  <  |al  <  1  .  In  terms  of  a  principal  orientation  angle  this  means  that  the  angle  is  restricted  to  be 
within  - 180®  <  ^  <  +  180®  .  Note  that  this  combined  set  of  “original”  and  “shadow”  parame¬ 
ters  implicidy  “knows”  the  shortest  way  back  to  the  origin  .  Lengthy  principal  rotations  of  more 
than  180°  are  avoided.  This  will  be  useful  when  designing  a  robust  attitude  feedback  control  law. 
Also  note  from  Eq.  (6)  that  for  the  range  -  180®  <  <j)  <  +  180®  the  modified  Rodrigues  par^e- 
ters  behave  very  linearly.  The  differential  kinematic  equation  of  motion  in  terms  of  the  modified 
Rodrigues  parameters  is  given  below"**^.  Note  that  the  equation  only  contains  second  order  polyno¬ 
mial  nonlinearities  in  G . 


4M^) 


+  [d]  +  ad^ 


Eq.  (8)  holds  for  both  the  “original”  and  the  “shadow”  set.  This  means  that  the  derivative  is 
well  defined  even  at  the  switching  point.  The  direction  cosine  matrix  in  term  of  the  modified  Ro¬ 
drigues  parameters 

’4(o|  —  G2  —  Gj) +  8G1O2+4G3X  8G1O3  —  402X 

C(d)  = - - 2  8ai  02-4032  4(-Oi +O2 -O3) +  X^  802O3+401X 

(1  +  0^0)  [  80103+4022  80203+401E  4(-of-o^+o|)  +  x2j 

Z  =  1-6^0 


1 


(1  +  6^0) 


2?,4 


OPEN-LOOP  DYNAMICS 

Rest-to-Rest  Principal  Rotation  Reference  Maneuver 

Instead  of  doing  a  computationally  expensive  optimal  control,  all  maneuvers  performed  will 
be  about  Hat  principal  axis  of  rotation.  This  will  allow  real-time  pre-computation  of  the  reference 
maneuvers.  This  solution  is  close  to  the  optimal  solution  and  much  faster  to  compute.  Euler’s  prin¬ 
cipal  rotation  theorem  states  that  any  reference  frame  can  be  related  to  another  reference  frame 
through  a  single-axis  rotation.  This  theorem  allows  any  three-dimensional  rotation  to  be  viewed  as 
a  single-axis  rotation  about  die  principal  axis,  as  illustrated  by  the  simple  one~ditnensional  e<}ua- 
tion  shown  below. 

30  =  «  (10) 

While  certain  gyroscopic  coupling  nonlinearities  must  be  accounted  for,  since  the  actual  mo¬ 
tion  will  be  fully  three-dimenrional,  Eq.  (10)  provides  a  simple  approach  to  design  a  reference  tra¬ 
jectory.  Let  N  denote  the  inertial  and  R  denote  the  open-loop  reference  frames.  The  initial  and 
final  reference  attitude  can  be  established  by  the  initial  and  final  direction  cosine  matrices 
[RN(to)]  and  [RN{tf)]  inthe sense 

ritf)=lRN{tf)]nitf),  Wo)  =  [mto)Mto)  (lla,b) 

The  rotation  from  the  initial  to  the  final  position  of  the  body  axes  is  established  by  a  direction 
cosine  matrix  [RR{tf,  fo )] » where 

Wf)  =  [/?i?(//.#o)]r(^o).  [RRitfJo)]  =  lRN{tf)][RN(to)f  (12a,b) 

Euler’s  Principal  axis  of  rotation  is  determined  by  finding  the  eigenvector  of  [RR{tf,  ro)] 
which  corresponds  to  the  eigenvalue  +1;  that  is,  we  find  the  comjjonents  k  }  of  the  unit 

vector  satisfying 


[M(f/,ro)]|j2[  = 


(13) 


The  principal  rotation  an^e  Of  can  be  found  by  extracting  the  diagonal  elements  from  the 
[/?/?(//,  fo)]  matrix^.  We  limit  our  principal  rotation  angles  to  be  within  0®  <  9  ^  180®  , 
which  is  done  automatically  when  using  the  inverse  cosine  function  below. 

(14) 


The  principal  axis  of  relation  can  also  be  foundl,  except  near  the  zero  and  il80®  case,  from 
the  matrix  elements  of  [^^(^»fo)]  ♦ 


/  = 


2sin0/ 


RR23  —  RR32 ' 
RR^i  RR12  * 
.RR12  —  RRii  - 


(15) 


Taking  the  inverse  kinematics  viewpoint,  we  can  prescribe  a  reference  trajectory  0r(f)  as  a 
rotation  about  the  principal  vector  of  /o)]  •  For  the  reference  trajectory  to  conform  with 

the  desired  initial  and  final  attitude,  it  is  necessary  that  0r(O  satisfy  the  boundary  conditions 
9r(0)  =  0  and  Of (//)  =  0/ . 

Using  the  reference  principal  angle  Of  (/)  and  the  principal  axis  of  rotation  /  ,  we  can  define 
the  reference  orientation,  angular  velocity  and  angular  acceleration  as 


p(0  =  /  •  tan 


0r(0 


a)r(t)  =  iQr(t), 


d(br 


it)  =  fQrit) 


(16a,b,c) 


4  ’  ■  ■'  dt 

where  p(t)  is  a  modified  Rodrigues  parameter  vector  which  parameterizes  the  direction  co- 
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•  ..motriv  in's}  Given  the  above  reference  body  angular  velocity  and  acceleration  and 

L“S“oSiS.!lesT4re„ce  control 


g  =  _3  [cb^lSWr  -  [a)ry(a  +fiv) 

at 


(17) 


Near-Minimum-Time  Maneuver 

The  ODtimal  control  for  a  rigid  body  minimum  time  maneuver  is  a  “bang-bang”  type  control. 
For  I^X^Ima^ler  through  a  principal  angle  0/  .  the  “bang-bang”  control  has  the  struc¬ 
ture:  _ 

/  tf\  /436/  _  /  49/  2 

2  )■  ® 

where  e;««  and  u^ax  are  one-dimensional  quantities  measured  along  the  principal  axis  of  ro- 

If  we  anticipate  that  the  “bang-bang”  control  wiU  excite  significant  vibration  of  the  flexible  de¬ 
grees  of  freedom,  it  is  easy  to  smooth  out  the  control  switches  using  cubic  splines^and  introduce 
“controllably  sharp”  torque  switches  using  the  smoothed  “bang-bang  control  shape  . 

(ifhHi))' 

atf  <t<^-cttf  =  ti 


0r(O  “  Q/noxi 


1. 


(18) 


-1. 


tz  <t<tf-aif  =  tz 


where  a  controls  the  sharpness  of  the  switches.  a  =  0  generates  Ae  bang-bang  instanta¬ 
neous  torque  switches  and  a  =  0.25  generates  the  smoothest  m^ber  of  Ae  family.  After  c^ 
ing  out  the  double  integration,  the  final  maneuver  time  is  found  m  terms  of  Ae  pnncipal  angle  ro¬ 
tated  0/  ,  the  maximum  principal  angular  acceleration  0max  and  the  smoothing  factor  a. 

(19a,b) 


^  \/em«'l-2a+|a2 


6/mix  ““ 


t^max 

3 


The  resulting  principal  angles  and  angular  velocities  can  be  seen  in  1>. 

was  chosen.  Obviously  the  maximum  increase  of  maneuver  time  ( for  a  =0.25)  is  less  than  38%, 
compared  to  the  “bang-bang”  (  a  =  0  )  case.  For  a  flexible  spacecraft,  due  to  the  d^re^e  m^- 
brational  energy,  the  actual  maneuver  time  (including  vibration  settling  ume)  is  typically  d  - 
creased  significanfly  by  using  the  smoothed  “bang-bang”  control  Even  though  we  are  not  si^ifi- 
cally  considering  the  flexible  spacecraft  case  at  this  point,  we  can  implicitly  cqnsidw  flexibility  by 
eliminating  sharp  torque  switches  which  can  be  anticipated  to  “ring”  the  s^ctore.  Quahtative  y,  a 
sufficiently  smooth  and  low  amplitude  torque  history  will  make  the  most  flexible  structure  behave 
more  like  a  rigid  structure  and  make  the  corresponding  reference  trajectory  more  trackable. 
These  statements  can  be  made  quite  rigorously,  see  for  example  .  For  weU-chosen  reference  ma¬ 
neuvers  and  uacking  law  design,  maneuver  times  for  flexible  spacecraft  can  usually  be  kept  within 
10  to  20%  of  the  theoretical  rigid  body  minimum  maneuver  times. 
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Figure  1  A  Sample  Torque  Shaped  Family  of  Near  “Bang-Bang”  Maneuvers 
Near-Minimum-Fuel  Maneuver 

The  torque  time  history  of  a  optimal  rigid  body  minimum-fuel  maneuver  consists  of  a  sharp 
initial  impulse  to  get  the  spacecraft  rotating,  a  long  coasting  period,  followed  by  a  sharp  reverse 
impulse  to  arrest  the  motion.  Naturally,  these  sharp  impulses  would  cause  havoc  for  a  highly  flex¬ 
ible  structure.  Therefore  a  smoothed  “bang-off-bang”  control  is  chosen  similar  to  the 
near-minimum-time  maneuver  presented  previously. 


0<r<ai// 

aitf  <t^aitf  +  a2tf = h 
ti  <t^2a\tf  +  a.2tf  =  t2 

t2<t<tf-  ICLytf  -r  Oitf  s  t3 

a.itj  =  u 

U  <t<tf-a^tf=ts 
ts  ^t<tf 


(20) 


The  instantaneous  control  switches  are  replaced  by  cubic  splines  with  the  rise  and  decay  shape 
having  controlled  sharpness.  Hence  two  torque  smoothing  factors  ai  and  0.2  are  used.  TTie  fac- 
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tor  ai  detennines  the  rise  or  fall  time  from  or  to  the  maximum  torque  to  zero  torque  as  a  percent¬ 
age  of  the  total  maneuver  time.  The  factor  a2  determines  how  long  maximuni  torque  is  applied, 
also  as  a  fraction  of  the  total  maneuver  time.  The  amount  of  fuel  used  is  chosen  implicitly  by  spec¬ 
ifying  the  two  parameters  tti  and  a2  • 

The  total  maneuver  time  for  the  smoothed  “bang-off-bang”  control  is  found  again  by  twice  in¬ 
tegrating  the  one  dimensional  principal  rotation  equation. 


\j'Qmax' OL\+ 0.2 -2a} -Saiai-a^’ 


40/ 


Q/nm  ^ 


^max 


(21a, b) 


The  sample  time  history  of  principal  angular  acceleration,  velocity  and  the  principal  angle  for 
a  smoothed  “bang-off-bang”  control  is  shown  in  Figure  2,  where  CX[  =  Ct2  =0.1  were  chosen. 
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Figure  2  A  Sample  Torque-Shaped  Family  of  Near  “Bang-Off-Bang”  Maneuvers 


Incorporating  Angular  Velocity  At  The  Final  Maneuver  Time 

The  principal  rotation  maneuver  presented  only  applies  to  a  rest-to-rest  maneuver.  To  track  a 
landmark,  it  is  desired  that  the  body  have  a  certain  angular  velocity  ©(t/)  at  the  end  of  the  ma¬ 
neuver.  This  allows  the  spacecraft  to  keep  the  sensors  pointing  toward  a  location  on  Earth  for  a  fi¬ 
nite  duration  of  time  and  essentially  achieve  gross  “motion  compensation"  for  smear-free  imaging. 
To  accomplish  this,  the  reference  motion  will  be  described  relative  to  a  moving  target  frame,  not 
the  inertial  frame.  Three  coordinate  frames  are  used: 

R:  open-loop  reference  coordinate  axes  ( or  follows  the  desired  trajectory) 

T:  target  motion  coordinate  frame 
N:  inertial  coordinate  frame 


Let  ©r/jv  be  the  body  angular  velocity  vector  of  the  target  frame.  In  order  to  match  up  with 
our  desired  motion,  the  target  frame  T  must  have  the  following  constraints. 

Sir/N  (0)  =  0  ^/N{tf)  =  ©(//)  (22a,b) 


[riV(t/)]  =  [iy/(r/)]  (23) 

Since  the  rest-to-rest  principal  rotation  is  described  relative  to  the  T  6ame,  these  conations  in¬ 
sure  that  the  actual  reference  motion  will  have  zero  inertial  angular  velocity  at  r=0,  and  the  desired 
orientation  and  angular  velocity  at  the  maneuver  end. 

Besides  these  three  conditions  any  target  motion  can  be  chosen.  ^  The  target  modoo  used  in  ^s 
study  was  chosen  to  be  a  pure  spin  rotation  about  the  axis,  since  an  analytic  solution  exists 

for  this  trajectory.  The  orientation  of  the  T  frame  at  any  time  t  is  given  as 

[TN(t)]  =  [7T(f,  tf)][TN{tf)]  (24) 


where  the  matrix  [7T(f.  t/)]  describes  the  pure  spin  motion  away  the  fin^  target  posi¬ 
tion.  Let  the  modified  Rodrigues  parameter  vector  pj  parameterize  the  [7T(f,  t/ )  j  matra  with 
the  condition  that  PrC?/)  =  0  .  The  unit  vector  h  is  the  principal  axis  of  the  target  motion  and 
is  defined  as 


It  = 


g>((r) 

|fi>((r)l 


(25) 


and  Gj"  is  the  target  principal  rotation  angle.  The  target  motion  pfif)  is  then  defined  as 

Pr(0  =  h  •  tan^  (26) 


where  0r(jy)  =  0  .  To  match  initial  and  final  conditions  of  the  target  angular  velocity  a 
cubic  spline  was  used.  By  choice,  this  will  result  in  the  reference  motion  having  no  angular  accel¬ 
eration  at  the  maneuver  end,  but  this  is  not  a  requirement  of  the  method  itself.  Any  target  angular 
velocity  history  that  matches  the  conditions  in  Eqs.  (22a,b)  could  have  been  used.  The  target  angu¬ 
lar  velocity  and  acceleration  are  defined  as: 

^/^(O  =  |Sb7Ar(t/)l|^| 

ddyr/Nit)  _ !%///(//)!  [gl_  J  .  t  (28) 

dt  tf  \  tf  \tf]  I 

After  once  integrating  Eq.  (27)  the  target  principal  rotation  angle  is  found. 

The  relative  position  of  the  reference  frame  to  the  target  frame  is  given  by  the  matrix  [/^(t)] 
which  is  found  through 

[Rnt)]  =  [mt)][TNit)f  (30) 


At  the  times  to  and  tf  the  relative  orientations  are  defined  as 

[Rnto)]  =  [RNitomTNito)f  (31) 


[/?r(r/)l  =  [/yV(r/)][77V(rf)f  =/  (32) 


Eq.  (12b)  is  now  rewritten  as 


[;?/?(r/,fo)i  =  [Rntf)][mto)f  =  iRnto)f  os) 

The  matrix  [/?/?(//,  /o  )]  defined  in  Eq.  (33)  is  used  to  define  the  rest-to-rest  princij^  rotation 
motion  for  the  case  where  the  reference  motion  is  supposed  to  have  a  final  angular  velocity. 

Given  the  maneuver  time  //  ,  we  would  be  able  to  accui^ely  (Ascribe  the  complete  target  mo¬ 
tion.  To  find  tf  though,  we  need  to  know  the  first,  which  itself  tfcpends  on 

the  target  motion.  Since  we  only  know  the  final,  not  the  initial  target  position  in  advance,  no 
closed  form  solution  is  available  to  find  1/  .  An  iterative  method  was  used  to  find  the  m^euver 
time.  The  initial  estimate  for  tr  was  found  by  assuming  complete  rest-to-rest  motion.  Using  this 
tf  a  new  to)]  matrix  was  found  and  with  it  znew  tf  .  This  method  conveiged  very 

quickly  if  half  of  the  difference  between  old  and  new  tf  was  added  to  the  old  t-  . 

The  matrix  [i?r(0]  is  given  as 

[RTiO]  =  [RR{t,  ro)l[l?T(ro)]  (34) 

where  the  [RT{to)]  matrix  was  defined  in  Eq.  (31).  The  desired  reference  motion  relative  to 
the  inertial  frame  is  found  from  Eq.  (30)  to  be 

[RNit)]  =  [Rnt)][TN{t)]  (35) 


where  the  target  motion  [riV(0]  is  gi''sn  in  Eq.  (24). 

The  angular  velocity  and  acceleration  expressed  in  Eq.  (16b,c)  are  now  expressed  relative  to 
the  target  frame  motion.  Hence,  let  us  relabel  these  quantities  as  expressions  relative  to  the  target 


frame  as 


0)^(0  =  S)r(t)  . 


^^/r(0  _  (0 

dt  dt 


(36) 


where  the  superscripts  indicate  in  which  coordinate  frame  the  vectors  are  written.  The  refer¬ 
ence  angular  velocity  expressed  relative  to  die  inemal  frame  is  given  as 

To  find  the  reference  angular  acceleration  relative  to  the  inertial  frame,  the  inertial  derivative 
of  Eq.(37)  is  taken. 


(38) 


For  the  limiting  case  where  the  target  fiame  has  zero  motion,  Eqs.  (37)  and  (38)  collapse  back 
to  the  rest-to-rest  case  given  in  Eqs.  (16b,c), 


CLOSED-LOOP  DYNAMICS 


Lyapunov  Method  To  Design  Nonlinear  Tracking  Control  Law 

A  nonlinear  tracking  control  law  is  de\’eloped  to  assure  that  the  reference  trajectory  is  asym^ 
totically  tracked.  One  advantage  of  this  nonlinear  control  law  over  other  control  laws  is  that  it  is 
globally,  asymptotically  stabilizing!  The  control  law  has  inherently  no  restriedons  on  the  size  of 
the  attitude  or  the  angular  velocity  error.  Secondly,  through  the  choice  of  the  aidtude  coordinates, 
this  control  law  will  bring  a  body,  which  has  tumbled  beyond  il80^  from  the  reference  motion, 
back  to  the  reference  trajectory  through  the  shortest  angular  distance.  The  three  coordinate  frames 
used  are: 

B :  actual  spacecraft  coordinate  frame 


R;  reference  coordinate  axes 
N:  inertial  coordinate  frame 

Let  the  [BR]  matrix  define  the  relative  attitude  of  the  spacecraft  to  the  reference  frame.  It  is  re¬ 
lated  to  [BiViftjl  as 

[BR]  =  [BN][RNf  (39) 


Let  the  modified  Rodrigues  parameter  vector  6  parameterize  tte  direction  cosine  matrix 
[BR].  This  vector  defines  the  orientation  error  of  the  spacecraft  relathe  to  the  reference  fr^e, 
achieving  6  ->  0  assumes  asymptotic  tracking  of  the  reference  motioa  The  cxtradiuon  of  the  a 
vector  from  the  [BR]  matrix  is  easily  accomplished  by  use  of  the  Po  parameter.  The  com¬ 
plete  transformation  is  given  below.  _ _ 

2Po  =+^trace(lBR])  +  l 


Oi 


02 


BR23  ~  BR32 
4po(l  +  Po) 
Bf?3i  —  BRi3 
4po(l  +  Po) 


(40) 


BR12  ~  BR2\ 

4po(l  +  Po) 

By  assuring  that  Po  ^  0  we  are  guaranteed  to  have  a  modified  Rodrigues  vector  with 
lal  ^  1  .  By  using  the  modified  Rodrigues  parameters  to  describe  the  error  in  orientation,  the 
feedback  control  law  will  inherently  know  the  “shortest  way”  back  to  tie  reference  frame.  As  ^ 
example,  if  the  spacecraft  has  rotated  a  principal  rotation  of  +200°  off  nim  the  reference  condi¬ 
tion,  the  control  law  will  know  to  let  the  spacecraft  complete  the  rotation.  It  will  perform  a  +160 
principal  rotation  instead  of  a  -200°  maneuver,  bringing  the  spacecraft  bick  to  the  reference  state 
“the  short  way  round”^. 

Obviously,  it  is  desired  to  make  the  body  frame  track  the  reference  ftame,  and  thus  the  objec¬ 
tive  of  the  tracking  control  law  should  be  to  make  any  departure  motion  C  vanish.  Let  all  the  fol¬ 
lowing  vectors  be  written  in  the  body  frame  B,  unless  noted  otherwise.  The  error  in  body  angular 
velocity  is  given  as 

5e>  =  SiB/N  —  [BR]S:l^  ('^^) 


The  reference  body  angular  velocrty  vector  must  be  transferred  into  m  ^y  frame,  since  it  is 
only  given  in  the  reference  frame  R.  The  error  in  body  angular  acceleration  is  found  by  taking  the 
derivative  of  Eq.  (41). 

-[BR]4:i(iiR/Nf  +  [(^b/n]IBRIS^^  (42) 

at  at  ct 

The  Lyapunov  function  for  the  feedback  control  law  is  defined  to  be 

V=i5S^35ai  +  2Rlog(l  +  d’"d)  (43) 

where  ^  is  a  scalar  gain  for  the  altitude  error  feedback.  Using  the  Icsantfam  of  the  departure 
motion  will  result  in  a  feedback  control  law  which  is  linear  in  C  .  As^siotras  points  out  in  Ref. 
5,  this  remarkable  fact  occurs  becanse  d/d/(21og^l +d  d))  =  5g)  a  .To  guarantee  global 
asymptotic  stability,  let  us  verify  that  the  first  time  derivative  of  V  is  negafve  definite. 

V  =  6a^34-(Soj)^+/i:-5#o  (44) 

dt 


Substituting  Eqs.  (42)  and  (2)  into  Eq.  (44)  yields 


V'  =  Sa^(-  [a>B/N  ]3wb/v  -  [&b/n  -  «  +/ 

at  ' 

After  defining  the  control  torque  vector  u  to  be 

5*  =-3(t«J!)|««)'' -  (aj/«][B/iiaV) 

-  Wn  ]3  -  Him  ly(£i" + sS,»  ) + ra"  +  P^‘ + f 


where  F  is  defined  as^® 

Fi  =  Fi-  sgn(5oj/ )  j  =  1 , 2, 3  (47) 

and  the  matrix  ?  is  a  positive  definite  angular  velocity  feedback  matrix,  and  substituting  u 
into  Eq.  (45),  is  shown  to  be  negative  definite. 

V  =  -5cb^P5ffl-6cb^(F-/)  <0  V8d3,d?iO  (48) 


For  clarity,  all  vectors  were  labeled  with  their  corresponding  coordinate  frame  in  (46). 
The  control  torque  given  above  is  dominated  by  Unear  terms  in  the  position  error  a  and  the  angu¬ 
lar  velocity  error  5o  .  It  guarantees  global  asymptotic  stability  during  both  the  tracbng  and  the 
end  game  phase,  assuming,  of  course,  negUgible  model  errors  and  perfect  state  measurements. 
Proper  gain  selection  will  result  in  a  good  rejection  of  model  and  external  disturbance  errors. 

Because  of  the  sgn  function  in  F  this  control  law  could  cause  some  chattering  if  the  angular 
velocity  measurements  are  noisy.  If  the  magnitude  of  F  is  small  enough  though,  this  should  not 
pose  any  practical  problems.  Having  the  F  term  in  the  control  law  does  guarantee  asymptotic  con¬ 
vergence  of  the  states  to  the  target  motion,  even  with  unknown  external  forces  present. 


Control  Feedback  Gain  Selection 

Assuming  zero  external  torques,  the  closed-loop  dynamics  are  found  by  substimting  Eqs,  (2) 
and  (42)  into  Eq.  (46).  Hie  resulting  differential  equation  only  depends  on  the  attitude  error  C 
and  the  body  angular  velocity  error  5©  . 


— (55))^  =  -  a:  •  3"*  a  -  3"^  P6oj  (49) 

dt 

Note  that  the  differential  equation  for  5©  is  linear  without  maldng  any  approxiinations.  The 
nonlinearity  of  the  closed-loop  dynamics  come  in  through  the  coupling  with  d  .  If  a  —  0  ,  then 
the  poles  of  Eq.  (49)  could  be  arbitrarily  chosen.  The  differential  equation  for  G  depends  quadrati- 

cally  on  d  and  is  given  by: 


dd 

It 


+  [d]  +  CG^ 


(50) 


After  linearizing  Eq.  (50)  about  5  =  0 ,  the  following  approximation  is  obtained 

(51) 

dt  4 

Remember  that  the  modihed  Rodrigues  parameters  act  like  Mgles  over  four.  This  fact  is  vis-  ^ 

ible  again  in  the  above  approximation.  Because  of  this,  the  linearization  using  modified  Ro¬ 
drigues  parameters  wll  be  valid  for  twice  the  rotation  range  compared  to  the  classical  Rodrigues 
parameters,  and  four  times  the  range  over  the  most  attractive  set  of  Euler  angles.  After  combining 
Eqs.  (49)  and  (51),  the  following  closed-loop  system  equations  of  motion  are  found: 


11 


232 


(52) 


Given  an  arbitrary  inertia  matrix  3  ,  a  root-locus  method  could  be  used  to  find  the  poles  of 
Eq.  (52).  The  roots  caimot  be  placed  arbitrarily  because  K  is  only  a  scalar  gain.  If  the  inertia  ma¬ 
trix  3  and  the  angular  velocity  feedback  matrix  P  are  chosen  to  be  diagonal  matrices,  then  Eq. 
(52)  can  be  decoupled  into  three  sets  of  two  equations 


i  =  1,2,3 


(53) 


whose  roots  can  be  solved  explicitly  as 


Note  that  the  only  approximations  made  in  the  above  analysis  are  Ae  linearization  of  Eq.  (50) 
and  Ae  assumption  of  a  diagonal  inertia  matrix  3  .  Since  Ae  linearization  of  Ae  modified  R<> 
drigues  parameters  are  valid  for  four  times  Ae  rotational  range  of  Ae  Euler  angles,  and  Ae  off  di¬ 
agonal  terms  m  Ae  inertia  matrix  are  usually  very  small  compared  to  Ae  diagonal  terms,  this  line¬ 
arization  will  typically  predict  Ae  dynamics  of  Ae  nonlinear  system  for  moderately  large  tracking 
errors. 

Figure  3  shows  Ae  root-locus  plot  of  Eq.  (54).  A  separate  can  be  chosen  for  each  body 


Figure  3  Root-Locus  Plot  of  Ae  Decoupled,  Linearized  Error  Dynamics 

Assuming  that  Ae  closed-loop  dynamics  will  be  slightly  under-damped,  we  can  write  Ae  angu¬ 
lar  velocity  feedback  gains  pi  in  term  of  Ae  controller  decay  time  constants  Tc  • 

K  =  23,^  i=  1.2,3  (55) 

The  scalar  aAtude  feedback  gain  K  is  still  free  to  be  chosen.  For  Ae  close-loop  dynamics  to 
be  under-damped,  the  condition  on  K  is 


(56) 


K>4-  i=  1.2.3 

Note  that  both  K  and  pi  determine  whether  the  closed-loop  dynamics  are  over-,  critically-,  or 
under-damped.  But  if  the  system  is  under-damped,  then  only  Pi  determines  how  fast  a  state  error 
will  decay.  On  the  other  hand,  the  gain  K  influences  the  frequency  of  the  oscillations  cOc,  • 


Control  Gain  Scheduling 

To  avoid  reaction  wheel  torque  saturation,  the  feedback  gains  are  lowered  whenever  the 
system  motion  error  is  too  large.  We  suggest  a  simple  heuristic  for  gain  scheduling,  which  c^  be 
sophisticated  as  necessary.  The  total  system  error  is  calculated  as  a  weighted  sum  of  the  attitude 
and  angular  velocity  error  vectors. 

error  =  |5fi)l  +  K  •  lo]  (58) 

If  this  measure  of  tracking  error  exceeds  some  nominal  value,  the  gains  are  lowered  to  some 
smaller  values.  Whenever  the  error  is  within  the  nominal  value,  the  gains  are  then  rmsed  agmn  to 
their  original  values.  This  assumes  only  two  sets  of  gains,  obviously  more  than  two  sets  could  be 

used. 

The  body  angular  velocity  feedback  gain  matrix  P  can  also  be  permitted  to  vary  with  time 
without  any  loss  of  stability  of  the  control  law  given  in  Eq.  (46).  The  only  requirement  is  that  P  re¬ 
mains  positive  definite.  The  attitude  feedback  gain  K,  however,  w^  considered  to  be  constant  dur¬ 
ing  the  stability  study.  Allowing  K  to  vary  in  time,  Eq.  (44)  is  rewritten  as 

V  =  6(0^  (s  ^  (56))'^  +  /To)  +  /:21og(  1  +  o^o)  =  -  +  A:21og(  I +6^  a)  (59) 

If  ^  is  changed  from  a  high  gain  to  a  low  gain,  (i.e.  a  large  system  error  is  present),  K  is  nega- 
tive  and  stability  is  still  guaranteed  during  the  transition  phase.  Only  if  is  changed  from 
gain  to  a  high  gain,  where  it  >  0  ,  is  stability  possibly  not  guaranteed.  If  ii:  is  large  enough,  V 
could  become  positive.  However,  since  the  transition  wiU  occur  oyer  a  finite  period  of  time,  over¬ 
all  stability  is  not  compromised.  Also,  the  maximum  positive  K  is  computable  at  any  time  to 
satisfy  V  <  tolerance  as 

tolerance  +  5©^P6fi) 

21og(l+d^o) 

Obviously  instantaneous  jumps  in  feedback  gains  should  be  avoided,  because  they  would 
cause  excessive  ringing  of  the  flexible  structure.  To  control  the  smoothness  of  the  feedback  gains 
time  history,  a  digital  low-pass  filter  is  added.  Any  jumps  in  feedback  gains  are  thus  filtered  out  to 
a  smooth  curve  with  a  controllable  rise  of  K . 


STATE  ESTIMATION 

The  purpose  of  this  nonlinear  estimator  is  to  cancel  any  measurements  errors  iii  the^body  atri- 
tude  vector  q  (given  in  modified  Rodrigues  parameters)  and  the  body  angular  velocity  (h  ,  even  in 
the  presence  of  an  unmodeled  external  torque  f  and  a  gyro  rate  bias  b  ^  Let  the  measured  states 
be  denoted  as  Xm  » ihe  estimated  states  as  X^st  the  actual  states  as  X . 


Qm 

Qest 

C5m 

il 

J 

COfxr 

a 

bfjt  J 

-best  - 

-b- 

(61) 
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The  rate  gyro  bias  b  is  assumed  to  be  constant  for  small  time  intervals,  thus  having  the  follow¬ 
ing  kinematic  equation 

4(f)  =0  (62) 


dt' 

Let  the  estimator  error  be  defined  as 


=[ll] 

LAh-l 


From  Eqs.  (2,8,62),  the  actual  s>^m  dynamics  can  be  written  as 

d 


dt 


(X)=F(X) 


r  0  1  ro-i 

-  S-^ii  +  d 

L  0  -1  LoJ 


(63) 


(64) 


where  the  Ff)  function  contains  the  dynamical  system.  The  angular  acceleration  d  due  to  the 
unmodeled  external  torques  is  defined  as 

5  =  3-7  (65) 

and  is  assumed  to  have  a  known  bound  D  satisfying  D/  ^  5/  .  If  the  bounds  of  the  rate  gyro 
bias  error  A5  and  of  the  angular  acceleration  due  to  external  forces  d  are  known,  then  the  follow¬ 
ing  dynamics  of  the  estimated  state  scan  be  shown  to  be  asymptotically  stable  for  arbitrary  large 
estimated  state  attitude  and  angular  velocity  errors. 

{Ea^  f  rO 

(66) 


-  r  ^  1 

^  r  0  1 

(  . 

V[ 

X/n  “  I  bis: 

—  ff  I  Xga  ~Xin  + 

best 

£a 

\  LoJ 

J  L  0  J 

Lo  J 

V 

0  JJ 

The  estimator  feedback  gain  matrix  H  is  positive  definite  and  partitioned  as 

r/fn  Hyi  H,3-| 

/=  H22  Hn 

-till  Hit  Hii. 

Similarly  to  Eq.  (47)  of  the  feedback  control  law,  the  vectors  and  Eq  are  defined  as 

[£5]-=  m2.x(abs([Hi2A5,mu]/))*sgn(Aqf)  (67) 

[£a],.  =  max(afas  ([flzzAhmax]; )  +  A)  *  sgn(Aa),)  (68) 

The  asymptotic  stability  of  Eq.  (66)  is  fuoven  with  the  Lyapunov  function 

(69) 


V=-e^e 

2 


Let  the  measured  states  be  broken  up  into  the  true  states,  the  random  white  noise  v  and  the 
rate  bias  components. 


X„=X  +  v  + 


rO-i 

b 

LoJ 


(70) 


By  enforcing  the  asymptotic  stability  requirement  V  <  0  and  by  making  use  of  Eqs.  (63),  (64) 
and  (66),  the  following  asymptotic  stability  condition  is  found. 

0 


^X  +  V  -  I^mJI  -  HU)  +  +  ^12^) 

-  Aa^(£a  +5  +  H22A5)  <  i^He 


(71) 
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Note  that  since  H  is  positive  definite,  the  right-hand  side  (RHS)  of  Eq.  (71)  will  always  be 
greater  than  zero  for  e  9^  0  .  Assuming  there  is  no  measurement  noise,  no  rate  gyro  bias  no 
unmodeled  external  torques,  than  the  estimator  dynamics  in  Eq.  (66)  “ 
stable.  We  offer  the  following  qualitative  observations  regarding  tumng  of  the  esUmator. 

If  an  unmodeled  external  angular  acceleration  d  is  present  with  a  known  bound  D  ,  then  the 
estowr  since  U.e.A^’^  tenn  of  Ae  left-hand  side  (LHS)  is 

to  b^  negative  definite  by  the  definition  of  Eg  .  Stability  is  still  guaranteed  for  any  positive  defi¬ 
nite  H  and  any  estimated  attitude  and  angular  velocity  errors. 

If  a  rate  bias  B  is  introduced  m*  a  bounded  etror  .  then  H  can  no  longer 
smalL  The  first  term  of  Ae  LHS  could  be  positive.  The  esttator  feedback  ^ 

chosen  large  enough  such  that  fUe  is  always  larger  than  Ae  first  ^  of  the  LHS.  The  s^nA 
third  and  fourA  term  of  Ae  LHS  are  guaranteed  to  be  neganve  deftniK  by  Ae  definiaon  of  E, 
and  Ed  .  and  because  H33  is  positive  definite. 

Once  white  measurement  noise  is  introduced,  the  estimated  smtes  will  not  converge  to  Ae  ac¬ 
tual  states  of  course,  but  will  oscillate  about  Aem.  While  doing  discrete  sapling  of  Ae  stMes  at 
At  intervals,  the  dominant  noise  term  of  Ae  estimator  dynamics  is  .  The  actual  jump  due  to 
^ise  from  one  sample  to  anoAer  is  bounded  by  Hv„ax^  .  To.furAer  adjust 
tics,  Ae  sampling  time  interval  can  be  Aned.  The  measurement  noise  also  has  a  second  degrading 
effect.  It  may  cause  Ae  sgn  functions  in  Eqs.  (67,68)  to  retam  an  incorrect  sign  of  A?,-  ^d  ACO/ . 
This  will  cause  a  secondary  noise  induced  effect  of  Ae  estimated  states  between  samples,  o 
order  of  EgAt  and  EaAt  respecdvely.  Again  Ae  filtering  errors  are  controlled  by  choosing  Ae 

sampling  interval. 

Under-  and  over-damped  estimator  djmamics  were  compared.  For  a  given  d^ay  time  con¬ 
stant.  Ae  over-damped  system  was  better  able  to  cancel  measurement  noise  than  Ae  under¬ 
damped  system.  To  assure  Aat  all  Ae  attitude  and  angular  velocity  measurement  errors  decay  at 
Ae  same  rate,  Ae  estimator  feedback  matrix  H  was  chosen  to  be  of  diagonal  form. 


Hcst 

0 
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I  0 
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0  1 
0 

hA 


(72) 


Writing  Ae  estimator  feedback  gain  Rest  A  terms  of  an  estimator  error  decay  time  constant 
we  get 

b2 


(66) 


The  estimator  feedback  gain  Hg  can  have  a  much  larger  decay  time  instant  than  Rest  . 
smce  Ae  rate  gyro  bias  is  assumed  to  change  very  slowly.  Having  a  small  «g  helps  in  reducing 
Ae  secondary  noise  effect  for  Ae  rate  gyro  bias  estimation.  In  practice,  we  may  use  Ae  above  esti¬ 
mation  algorithm  to  baseline  a  Kalman-Filter,  or  oAer  linear  state  algonthm,  appropnate  tor 
real-time  on  board  implementation. 

RESULTS 

The  following  figures  show  the  results  of  rigid  body  rotation  simulation.  The  body  inema  ma¬ 
trix  3  has  only  diagonal  entries  of  200  kgm^,  200  kgm^  and  1 18  kgm  corresponding  to  the  first, 
second  and  Aird  body  axis.  The  spacecraft  has  three  reaction  wheels  Signed  with  the  b^ody  axis 
whose  inertia  about  Ae  rotation  axis  are  0.00955  kgm^,  0.1240  kgm  and  0.00955  kgm  reyc- 
tively.  The  maneuver  takes  the  spacecraft  (m  3-2-1  Euler  angles)  from  (-4  ,-55  ,4  )  to 
(4°,55°,-4°).  The  rotation  is  mainly  about  Ae  pitch  axis  with  some  slight  yawing  and  r(3lling.  1  he 
craft  st^  out  wiA  zero  angular  velocity  and  is  required  to  have  a  final  angular  velocity  of  -1  /s 
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about  the  pitch  axis  at  the  end  of  the  maneuver.  The  error  in  initial  attitude  and  angular  velocity  is 
(-0.05°,0.8“.0.05°)  and  (-0.025°/s,0.1°/s,0.0257s). 

The  feedback  control  law  was  chosen  to  have  a  time  constant  of  4  seconds  and  an  attitude 
feedback  gain  K  of  44.  This  results  in  the  feedback  response  in  the  pitch  and  yaw  axis  fciving  a 
damped  frequency  of  9.05  ®/s,  and  the  roll  axis  having  d^ped  frequency  of  14.4  °/s.  The  estima¬ 
tor  time  constant  T-  was  set  to  be  0.4  seconds,  an  order  of  magnitude  faster  than  T^.  The  initial  es¬ 
timated  3-2-1  Euler  angles  were  (-4.1V55.5®,3.95*).  The  attitude  noise  measurements  were  sub¬ 
jected  to  random  noise  of  the  magnitude  of  4e-5  (given  in  MRP).  The  initial  estimated  body  angu¬ 
lar  velocities  were  (-0.027s, 0.157s, 0.037s).  The  angular  velocity  measurement  noise  level  was 
set  to  5e-5  7s. 


Figure  4  Open-  and  Closed-Loop  Attitude  for  2nd  Body  Axis 

The  total  maneuver  time  was  104.09  seconds.  Figures  4  and  5  show  the  attitude  time  history 
in  MRP  space.  The  closed-loop  motion  accurately  tracks  the  open-loop  trajectory.  Figure  4  shows 
the  large  pitching  maneuver.  Since  a  final  negative  angular  velocity  is  required  about  the  2nd  body 
axis,  the  craft  has  to  rotate  beyond  the  target  attitude  and  return  to  it  with  the  desired  angular  veloc¬ 
ity.  The  open-loop  maneuver  designed  in  this  paper  performs  this  task  in  a  very  smooth  and 
near-optimum  fashion. 


Figure  5  Open-  and  Closed-Loop  Attitude  for  1st  and  3rd  Body  Axis 

Figures  6  and  7  show  the  time  history  of  the  angular  velocities.  The  open-loop  maneuver  cor¬ 
rectly  ends  with  a  zero  angular  velocity  about  the  1st  and  3rd  body  axis,  and  with  -l7s  about  the 
second  body  axis  with  no  angular  acceleration.  If  a  final  angular  acceleration  is  required,  this 
could  easily  be  incorporated  into  the  target  trajectory  used  to  generate  the  open-loop  motion. 


The  initial  state  errors  are  canceled  by  the  feedback  control  law  and  the  open-loop  trajectory  is 
tracked  accurately. 


j - -j - 1 - 


3.001 

i 

2.00; 

1 

1 

1 

o 

•o 

1.00; 

0.00; 

S 

-1.00; 

1 

1 

i 

1 

I 

-2.00- 

— 1 — 1 — r 

I 


{ 


I  I  I 

I  i  I 

_ I - 1 — 

I  I  I 


axis  2  (open  loop) 


- axis  2  (closed  loop) 


!  I  I 

I  i  } 

H —  —  —  r- - 

I  I  I 

_ i— 

I  I  I 

j - J 

I 
I 

hH- 
I 
I 


I 

\ 

_L-- 

I 


I 

-I — 
! 
i 


0  10  20  30 


I  I  1  .  1  I  I- 

40  50  60  70 

time  [s] 


80  90  100  no 


Figure  6  Open-  and  Closed-Loop  Body  Angular  Velocity  for  2nd  Body  Axis 


Figure  7  Open-  and  Closed-Loop  Body  Angular  Velocity  for  1st  and  3rd  Body  Axis 

Figures  8  and  9  show  the  time  history  of  the  internal  control  torque  exerted  onto  the  three  reac¬ 
tion  wheels.  The  maximum  torque  encountered  is  0.3108  Nm  by  the  second  reaction  wheel.  The  ^ 

measurement  noise  is  not  visible  in  Figure  4  because  of  the  relatively  high  torques.  The 
closed-loop  time  history  appears  smooth  and  asymptotically  approaches  the  open-loop  torque  time 
history. 
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Figure  8  Open-  and  Closed-LoopControl  Torque  for  2nd  Reaction  Wheel 
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The  measurement  noise  is  visible  in  the  time  histories  of  the  1st  and  3rd  reaction  wheek  since 
they  are  only  exerting  relatively  low  torques.  But  even  here  the  noise  is  small  compared  to  the 
torques  and  does  not  pose  any  fine  pointing  problems.  The  closed-loop  Ume  history  sull  asymp  o  - 
ically  approaches  the  open-loop  control  torque. 
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Figure  9  Open-  and  Closed-Loop  Control  Torque  for  1st  and  3rd  Reaction  Wheels 

Figure  10  shows  the  time  history  of  the  attitude  tracking  error  between  the  estimated  states 
and  the  open-loop  states.  The  Unearization  used  to  find  the  controller  feedback  ve^  accu¬ 
rately  models  the  actual  nonlinear  feedback  dynamics.  The  decay  ttme  consmts  and  Ae 
frequencies  match  with  the  simulation  very  well.  As  predicted,  the  1st  axis  has  a  higher  damped 
frequency  than  the  2nd  and  3rd  axis. 


Figure  10  Closed-Loop  Attitude  Tracking  Error 

Figure  1 1  shows  the  time  history  of  the  angular  velocity  tracking  error.  Similar  observation 
as  with  the  attitude  tracking  error  can  be  made.  In  both  cases  the  initial  state  error  is  asymptoti¬ 
cally  canceled.  The  error  is  effectively  gone  after  about  20  seconds.  The  measurement  noise  lev- 
els  are  too  low  to  be  visible  on  these  figures. 
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Figure  11  aosed-Loop  Body  Angular  Velocity  Tracking  Error 
Figures  12  and  13  show  Ae 

mated  states  and  the  actual  ^  ^ics  are  over-damped  and  errors  decay  an  order 
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CONCLUSIONS 

A  nonlinear  feedback  control  approach  has  been  developed  for  iirge  three-dimensional  rota¬ 
tional  maneuvers.  A  unique  coordinate  choice  and  the  use  of  Lyapmov  conttol  d^i^  ’ll  ^  ^ 
are  the  key  new  ingredients  blended  to  produce  these  results.  To  avdd  exce  jsive  ringing  of  the 
structure,  the  near-niinimum-time  and  near-minimum-fuel  reference  control  torques  were 
smoothed  with  cubic  splines. 

The  feedforward/feedback  control  law  presented  is  globally  asynptotically  stable,  even  under 
the  influence  of  unmodeled,  external  torques  with  a  known  bound.  The  nonlinear  esumator  h^ 
proven  Lyapunov  st^ility,  and  asymptotic  stability  in  the  absence  of  measurement  noise.  It  is 
also  able  to  compensate  for  unmodeled  external  torques  and  rate  gyro  biases. 

The  actual  closed-loop  controller  and  estimator  feedback  dynarics  matched  veiy  well  with 
the  dynamics  predicted  in  the  feedback  gain  selection  sections,  since  only  the  attimde  dynamics 
had  to  be  linearized.  Because  of  the  choice  of  attitude  coordinates,  th;  modified  Rodrigues  param¬ 
eters,  this  linearization  is  valid  for  a  range  of  attitude  errors  four  timis  larger  than  if  Euler  angles 
were'used,  and  two  times  larger  than  if  the  classical  Rodrigues  paramiiers  were  used. 

The  maneuver  demonstrated  was  able  to  track  the  open-loop  trajereory  asymptotically  and  can¬ 
cel  any  initial  state  or  estimator  errors. 
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Analytical  expi«ssions  are  developed  for  computing  eigenvector  derivatives,  specialized  for  the  case  of  me¬ 
chanical  second-order  dynamic  systems.  Both  exact  and  approximate  formulations  are  developed  using  a  modal 
approach.  The  new  exact  formulations  are  found  to  be  numericaUy  accurate  and  to  require  signifi^tly 
less  computing  time  than  the  corresponding  generalized  formulations.  An  improved  approximate  method  is  also 
introduced  for  computing  a  truncated  set  of  eigenvector  derivatives  for  large  structural  systems.  Numerical  ex¬ 
amples  are  included  to  evaluate  the  effectiveness  of  the  approximate  formulations,  and  they  are  found  to  be  very 
efficient  in  the  cases  studied. 


L  Introduction 

For  many  analysis  and  design  problems  in  engineering  system 
analysis,  including  applications  such  as  identification  of  dy¬ 
namic  systems,^*^  redesign  of  vibratory  systenw,^"^  and  design  of 
control  systems  by  pole  placement,^“'^  it  is  widely  known  in  the 
engineering  literature  that  eigenvalue  and  eigenvector  derivatives 
with  respect  to  design  parameters  are  useful. 

In  the  past  20  years,  several  algebraic  methods  for  computing 
eigenvector  derivatives  have  been  studied  by  many  researchers.*^”*^ 
Nelson*^  has  proposed  an  algebraic  method  for  computing  eigen¬ 
vector  derivatives.  In  this  formulation,  the  eigenvector  derivatives 
can  be  computed  using  only  the  eigenvector  of  interest  together 
vrith  some  algebraic  manipulation.  F6x  and  Kapoor*^  present  ex¬ 
pressions  for  the  rates  of  change  of  eigenvalues  and  eigenvectors 
with  respect  to  the  design  parameters  of  the  structure'  Recently, 
Lim  et  al.*^  re-examined  this  problem  and  provided  a  new  formula¬ 
tion  for  computing  eigenvector  derivatives  and  also  established  im¬ 
portant  relationships  between  left  and  right  eigenvector  derivatives. 
Dailey*®  presents  an  algorithm  for  computing  eigenvector  deriva¬ 
tives  for  real  symmetric  matrices  in  the  case  of  repeated  eigen¬ 
values.  Improved  approximate  methods  for  eigenvector  derivatives, 
using  only  an  available  subset  of  mode  shapes,  are  presented*®**^ 
for  extremely  large  systems.  All  the  above  formulations  are  derived 
for  the  general  non-self-adjoint  systems  under  the  assumption  that 
matrices,  eigenvalues,  and  eigenvectors  are  differentiable,  except  at 
isolated  points;  most  applications  reported  have  been  to  mechanical 
dynamic  systems. 

It  is  widely  known  that  the  dynamics  of  a  large  class  of  mechani¬ 
cal  systems  can  be  represented  most  naturally  by  second-order  sys¬ 
tems  of  differential  equations  with  several  special  properties.  For 
applying  optimization  or  iterative  design  ideas  to  these  systems,  the 
second-order  differential  equations  are  usually  transformed  into  a 
higher  dimensioned  first-order  state  space.  Since  the  dimension  of 
aerospace  structural  dynamic  systems  is  usually  large,  one  often 
encounters  uncomfortably  high  computational  burden  to  compute 
eigenvector  derivatives  using  any  of  the  available  formulations.  Note 
that  eigenvector  derivatives  are  central  features  for  many  algorithms 
utilizing  iterative  methods  that  modify  the  eigenstructure,  and  the 
computation  time  per  iteration  is  very  important. 
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There  exist  several  properties  of  the  system  matrices  describ¬ 
ing  mechanical  systems  that  we  exploit  in  the  present  paper  to 
significantly  reduce  the  computational  burden.  In  this  paper  efficient 
formulas  for  computing  eigenvector  derivatives  for  a  large  family  of 
mechanical  second-order  systems  are  derived  by  eliminating  some 
unnecessary  steps  that  are  associated  with  transforming  the  differ¬ 
ential  equations  into  a  first-order  state  space  and  applying  general- 
purpose  algorithms.  Note  that  Fox  and  Kapoor’s  formulation*^ 
reflects  structural  characteristics  instead  of  treating  general  eigen¬ 
value  problems.  Therefore  it  can  be  easily  applied  for  optimum 
design  of  structures.  However,  damping  characteristics  are  not  con¬ 
sidered  in  their  formulation,  and  therefore  it  is  basically  a  special 
case  of  Lim  et  al.  formulation.*^  In  other  words.  Fox  and  Kapoor’s 
formulation*^  cannot  be  applied  for  both  the  control  law  design 
problem  and  the  structural  optimization  problem,  since,  in  the  gen- 
.  eral  setting,  both  of  these  problems  include  artificial  or  aerodynamic 
damping.  Since  our  formulation  includes  linear  damping  character¬ 
istics,  it  can  be  utilized  for  solving  a  large  class  of  optimization 
problems  concerned  with  mechanical  second-order  systems.  A  nu¬ 
merical  study  is  included  to  evaluate  the  effectiveness  of  the  new 
formulations.  An  improved  method  for  approximating  a  truncated 
set  of  eigenvector  derivatives  for  large  structural  systems  is  also 
presented  and  its  utility  is  evaluated. 

n.  Eigenvalue  Problems  and  Modal  Derivatives 

Consider  a  linear  structure  (modeled  by  a  finite  element  or  similar 
discretization  scheme)  in  which  the  configuration  vector  x  is  gov¬ 
erned  by  the  system  of  linear  second-order  differential  equations 

Mx(t)  +  Ci:(f)  +  Kx{t)  =  Du{t)  (1) 

where  is  the  n  x  /i  positive-definite  symmetric  mass  matrix,  C 
is  the  /I  X  n  positive-semidefinite  symmetric  structural  damping  ma¬ 
trix  that  can  be  diagonalized  via  modal  coordinate  transformation, 
/C  is  the /I  X  n  positive-semidefinite  symmetric  stiffness  matrix,  and 
D  is  the  n  X  m  control  influence  matrix. 

The  closed-loop  system  can  be  written  as 

Mx{t)  +  Cx{t)  +  Kx(t)  =  0  (2) 

In  a  control  design  problem,  the  control  law  usually  feeds  back  po¬ 
sition  and  velocity  information,  and  mass  matrix  M  maintains  its 
constant,  symmetric,  positive-definite  characteristics,  but  the  damp¬ 
ing  and  stiffness  matrices  C,  K  will  be  changed  by  feedback  such 
that  the  open-loop  symmetry  and  definiteness  characteristics  are  not 
generally  guaranteed.  In  a  structural  optimization  problem,  all  ma¬ 
trices  will  most  generally  be  perturbed,  but  A/,  C,  K  will  maintain 
their  symmetry  and  definiteness  properties  over  all  admissible  de¬ 
signs.  In  all  cases  where  we  consider  eigenvector  derivatives  with 
respect  to  system  parameters  or  control  gains,  the  system  mairicc.s 
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Af ,  C,  AT  will  be  assumed  to  be  analytic  fiincdons  of  the  system 
design  parameters  or  control  gains.  . 

Generalized  Eigenvalue  Problem 

In  order  to  solve  eigenvalue  problems  for  mechamcal  second- 
order  systems,  Eq.  (2)  can  be  transformed  to  the  standard  first-order 
state-space  form 

/ 

or 

Bz^Az  (4) 

where 


"1- 

-1:1 

Lo 

AfJ* 

L-x  -cj 

Equation  (4)  represents  the  generalized  eigent^ue  problem  for  the 
given  system,  and  in  this  paper,  only  nondefective  systems  that  have 
a  set  of  n  linearly  independent  eigenvectors  will  be  considered 
We  observe  that  there  is  an  infinity  of  possibiliti^  iinplicit  m  the 
above  transformed  equations;  the  matrix  L  is  at  this  point  unspeci¬ 
fied.  For  selection  of  the  L  matrix,  we  must  consider  the  impact  of  the 
selection  of  L  upon  numerical  accura(^  and  efficiency  in  comput¬ 
ing  eigenvalues  and  eigenvectors;  a  symmetric  nonsinguto  matrix 
is  vridely  used  for  convenience.  In  the  structural  dynamics  literature, 
the  most  popular  choices  for  L  are  either  the  mass  matrix  Af  or  the 
stiffness  matrix  K.  If  the  system  includes  rigid-body  modes,  then 
the  K  matrix  will  be  singular  with  the  dimension  of  the  null  space 
being  the  number  of  rigid-body  degrees  of  freedom,  and  therefore, 
the  mass  matrix  Af  is  a  better  candidate  for  those  systems.  Note  that 
for  the  L  =  Af  case,  the  B  matrix  is  always  a  constant  positive- 
definite  symmetric  matrix  for  the  general  control  design  problem 
(assuming  the  control  law  utilizes  only  position  and  velocity  infor¬ 
mation  for  feedback).  On  the  other  hand,  for  the  L  =  AT  case,  the  B 
matrix  wiU  be  modified  during  the  stractural  optimization  process 
and  the  symmetric  property  is  generally  lost  due  to  feedbaclo  Note 
that  if  B  is  ill-conditioned,  then  this  can  rule  out  the  possibility  of 
computing  any  generalized  eigenvalue  accurately  (Ref.  18,  p.  395). 
Since  the  condition  number  of  a  matrk  provides  a  useful  measure 
of  numerical  accuracy  in  matrix  manipulations,  it  would  be  useful 
to  discuss  the  condition  of  the  B  matrix  for  the  selected  L  matrix 
briefly.  Our  experience  with  such  studies  indicates  that  the  condi¬ 
tion  number  of  the  B  matrix  for  the  L  =  M  choice  is  typically 
smaller  than  that  for  the  L  =  AT  case;  the  common  existence  of 
many  low-frequency  eigenvalues  is  associated  with  a  nearly  rank- 
deficient  stiffness  matrix.  This  practical  point  of  view  indicates  that 
constructing  the  B  matrix  using  L  =  Af  will  usually  lead  to  better 
conditioned  computations  and  more  accurate  numerical  results  than 
using  L  =  AT.  For  low-dimensioned  problems  with  no  rigid-body 
degrees  of  freedom,  the  condition  of  all  system  matrices  is  typically 
good,  and  therefore  the  stiffness  matrix  can  be  used  in  this  situation 
for  L  with  excellent  numerical  efficiency  and  also  without  degrad¬ 
ing  the  numerical  accuracy.  However,  for  large  structural  dynamics 
problems  with  rigid-body  modes  or  many  low-frequency  modes,  we 
recommend  choosing  L  as  the  mass  matrix,  as  a  rule  of  thumb,  for 
numerical  stability  and  accuracy. 

The  right  and  left  eigenvalue  problems  associated  with  z  =  <pe 
solutions  of  Eq.  (4)  are,  respectively, 

XiB<l>i=A<f>i  i  =  l,2 . 2n 

=  A^tl^i  j  =  1, 2, . . . ,  2n 

where  we  adopt  the  conventional  normalization  of  the  biorthogo¬ 
nality  conditions  for  the  eigenvectors  as 

</,fB</.,  =  l  «  =  1.2 . 2n 

if,]  B<f>i  =  Sij  i.;  =  1.2 . 2/1 


/.y  =  1.2 . n 


where  iY  denotes  the  transpose  of  the  given  vector.  It  is  possible 
that  the  above  normalization  er^tion  cannot  be  appH^  in  some 
circumstances,  because  it  occasionally  happens  that  4>i  B<f>i  my 
generate  a  zero  value.  However,  the  probability  of  encountering 
tMs  condition  can  be  reduced  to  essentially  zero  for  structural  dy¬ 
namics  applications  when  spedal  properties  of  admissible  matrices 
are  taken  into  account  Also,  note  that  with  normalization  equa¬ 
tion  (6)  the  normalized  eigenvectors  are  unique  within  a  sign;  -<t>i 
gives  the  same  information  as  It  is  apparent  that  a  consistent  and 
unique  eigenvector  can  be  obtained  by  considering  the  sign  of  any 
one  nonzero  element  of  each  eigenvector.  This  property  docs  not 
generate  any  problem,  if  any  formulation  (for  example,  eigenvector 
sensitivity)  utilizing  eigenvector  information  also  reflects  the  si^ 
of  the  corresponding  eigenvector,  consistently.  We  will  discuss  this 
further  in  the  subsequent  section. 


Eigenvalue  and  Eigenvector  Derivatives 

The  usefulness  of  eigenvalue  and  eigenvector  derivatives  in  de¬ 
sign  algorithms  for  engineering  system  analysis  is  well  loiown. 
Some  specific  applications  include  identification  of  dynamic  sys¬ 
tems,  redesign  of  vibratory  systems,  design  of  control  gains  by 
eigenstructurc  assignment,  and  sensor/actuator  placement  optimiza¬ 
tion,  In  order  to  apply  gradient-based  optimization  algorithms,  it  is 
usefol  to  compute  analytical  partial  derivatives  of  eigenvalues  and 
eigenvectors  with  respect  to  the  system  design  parameters. 

The  differentiability  of  the  cigenv^tors  has  been  addressed  in  the 
recent  literature,'^” and  most  of  the  papers  are  in  the  applications- 
driven  engineering  optimization  literature;  some  aspects  of  eigen¬ 
vector  differentiation  in  a  general  sense  have  been  addressed  in 
the  linear  algebra  literature'*”^;  however,  the  circumstances  un¬ 
der  which  eigenvectors  are  not  differentiable  docs  not  appear  to 
be  adequately  treated.  Therefore,  there  may  be  need  for  coUabo- 
ration  between  engineering  community  and  applied  linear  algebra  . 
researchers  to  address  the  problem  of  eigenvector  differentiation, 
with  a  special  focus  upon  loss  of  differentiability  (e.g.,  near  me 
repeated  eigenvalues  and  other  singular  circumstances).  Extensive 
numerical  experience  with,  for  example,  the  formulations  derived 
by  Urn  ct  al.'^  indicate  that  consistently  normalized  eigenvectors 
using  Eqs.  (6)  are  differentiable  except  in  isolated  events.  We  avoid 
.  the  known  degenerate  situations  here,  by  ruling  out  the  obvious  pos- 
sibilities  by  enforcing  definiteness  assumptions  on  the  mass  matrix, 
and  we  do  not  treat  the  case  of  repeated  cigenvducs. 

For  control  design  applications,  matrix  A  is  typically  formed 
from  constant  system  matrices  Af ,  C,  K  and  optiimzation-process- 
variable  gain  matrices,  and  by  taking  matrix  Af  for  L,  matrix  B  is  a 
constant  positive-definite  symmetric  matrix  md  the  cigenvducs  are 
distinct  by  assumption.  For  the  structural  optimization  applications, 
matrices  A  and  B  consist  of  varying  Af ,  C,  and  K  matrices,  which 
are  ^sumed  variable  as  functions  of  the  design  parameters  such  as 
beam  thickness,  actuator  locations,  etc.  For  dealing  with  these  en¬ 
gineering  problems,  matrices  A  and  B  are  assumed  to  be  an^ytic 
functions  of  the  design  parameters,  and  we  made  the  heunstically 
reasonable  assumption,  consistent  with  our  experience,  that  eigen¬ 
vectors  are  differentiable,  but  with  special  care  taken  in  accounting 
for  the  normalization  conditions  in  performing  the  differentiation 
process.  Readers  may  refer  to  Refs.  18-20  for  discussions  related 
to  the  sensitivities  of  perturbation  of  eigenvectors  for  the  general 

eigenvalue  problem.  / 

Differentiating  Eqs.  (5)  and  using  Eqs,  (6)  (utilizing  a  modal 
expansion  approach)  with  respect  to  the  design  variable  p.  we  can 
obtain  the  results'^ 
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1  ,rfdA  ,  9B\.  .,  . 

rdB 
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Note  that  the  above  expressions  are  valid  only  for  the  distinct  eigen¬ 
value  case.  Except  for  isolated  events  such  as  multiple  eigenvalues 
and  associated  root  bifurcations,  we  assume  the  eigenvalues  and 
eigenvectors  to  be  smooth  differentiable  functions  of  the  design  pa¬ 
rameter.  The  case  of  repeated  eigenvalues  is  considered  in  another 
recent  study.*® 

Modal  Derivatives  for  the  Second-Order  Systems 
The  cigenstructure  sensitivity  formulas  introduced  in  the  previous 
section  are  useful  in  control  design  and  structure  optimization  prob¬ 
lems.  Since  derivative-based  iterative  routines  arc  often  engaged 
in  these  applications,  it  is  important  to  calculate  *the  eigenvector 
derivatives  both  accurately  and  efficiently.  In  this  section,  by  uti¬ 
lizing  well-known  properties  for  linear  mechanical  second-order 
systems,  efficient  formulas  for  computing  eigenvector  derivatives 
arc  established. 

The  corresponding  right  and  left  eigenvalue  problems  associ¬ 
ated  with  exponential  solutions  (i.e.,  x  —  ae^)  for  the  mechanical 
second-order  system  [Eq.  (2)]  can  be  written,  respectively,  as 

(a,;  Af  -f-  X,-C  -b  K)cci  =  0 
^  (12) 
(x?M+x,c+/i:)  ft=o 

where  X|,  a/,  and  are  fth  eigenvalues  and  right  and  left  modal 
vectors,  respectively  and  generally  have  complex  values.  The  two 
most  popular  choices  for  L  in  Eq.  (3)  will  be  considered  in  this 
study. 

CasehL^M 

The  eigenvalue  problem  using  the  mass  matrix  for  L  can  be  rewrit¬ 
ten  as 

fMoi  ro  Ml 

^'[o  -cy' 

(13) 

where  4>i  €  and  €  R‘^  are  eigenvectors  normalized  using 
Eq.  (6)  and  can  be  partitioned  as 

-!:n'  - 

By  substituting  Eq.  (14)  into  Eq.  (1 3)  and  comparing  it  with  Eq.  (12), 
a  relationship  between  the  right  eigenvectors  of  the  first-order  sys¬ 
tem  and  the  right  eigenvectors  of  the  second-order  system  can  be 
obtained  as 


Also,  using  Eqs.  (13-15)  in  Eq.  (6)  yields  the  normalization  equa¬ 
tions  (biorthogonality  conditions) 

(l  -H  =  I 

r  .r 

MOCi  == 


Considering  a  positive-definite  symmetric  M  matrix  in  Eq.  (6), 
whenever  a  complex  eigenvalue  pair  has  purely  imaginary  parts 
with  an  absolute  value  of  unity  Q.c.,  Xi  =  ±i),  the  first  equa¬ 
tion  yields  zero,  and  obviously  this  equation  cannot  be  applied  for 
normalization  of  the  corresponding  mode’s  eigenvector.  However, 
in  control  design  or  structure  optinuzation  applicatio  as,  we  rarely 
encounter  this  condition,  since  during  the  optimization  procedure 
our  closed-loop  eigqivalues  are  constrained  to  lie  in  the  stable  region 
due  to  closed-loop  stability  constraints,  and  of  course,  this  singular 
condition  is  easy  to  check.  One  other  condition  exists  where  we 
may  have  a  problem  with  normalizing  the  eigenvector.  Suppose  that 
a,'  =  jc  +  iy,  where  jc  and  y  are  real  vectors,  both  not  zero;  then 
ocj  Moti  =  0  if  both  x^Mx  =  y^My  and  My  =  0.  In  response 
to  questions  raised  during  the  review  process,  we  have  studied  this 
condition  and  have  been  unable  to  formally  rule  it  out.  We  believe 
it  to  be  a  singular  condition  rarely  encountered  but  easily  tested  for. 
Thus,  the  normalization  is  not  universally  valid  because  the  normal¬ 
ization  equation  <f>T  =  1  [Eq.  (6)]  can  fail  under  a  few  known 
circumstances.  From  an  engineering  point  of  view,  it  is  almost  al¬ 
ways  useful  (because  the  singular  situations  arc  rarely  encountered 
and  furthermore  may  be  easily  tested  for). 

The  eigenvalue  derivatives  for  second-order  systems  can  be  ob¬ 
tained  by  using  Eqs.  (8)  and  (13-15): 


^fdA  dB\ 


,  dc  dK 
dp  dp 


-dM 

0 

dB 

dp 

ap  ~ 

0 

dM 

dp  - 

Following  a  modal  expansion  approach,  by  substituting  Eqs.  (13- 
15)  into  Eqs.  (9-1 1),  the  eigenvector  derivatives  for  the  second-order 
systems  can  be  represented  as 


. ^ 

'  '  (19) 


1  dK\ 

1  ^ 

=  +  )^kXi)ocl(M  +  M^)ai 

1  ,,,r/  dM  dC  aK\ 


Only  complex-conjugate  pairs  of  eigenvalues  and  eigenvectors  oc¬ 
cur  for  the  case  of  most  interest  (underdamped  second-order  systems 
without  rigid-body  degrees  of  freedom),  and  the  derivatives  of  the 
corresponding  complex-conjugate  eigenvector  pairs  are  also  obvi¬ 
ously  complex-conjugate  vector  pairs.  By  making  use  of  this  prop¬ 
erty,  the  computation  time  for  calculating  the  eigenvector  derivatives 
244  for  the  complex-conjugate  pairs  can  be  immediately  reduced  by  half. 
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Case  11:  K 

The  eigenvalue  problem  using  the  stiffness  matrix  as  L  can  be 
rewritten  as 

In  this  case,  the  normalization  equations  (biorthogonali^  condi¬ 
tions)  are  obtained  as 

ccJ{K  +  X^M)a>=l 

p](K-XiXjM)ai=Sij 


The  procedure  for  deriving  eigenvalue  and  eigenvector  derivatives 
for  this  case  is  similar  to  the  previous  case,  and  therefore  only  the 
final  results  are  summarized: 


3^ 

dp 


-  =  ^  =  X)  ^ . 


2n 

(24) 


where 


X;  .  ac  ^  dK\ 


>¥•1 


1  ^ 

=  K'^)  +  XMM  ■¥  M’')]®,- 


Note  that  the  eigenvectors  <j>i  and  of  the  first-order  systems  can 
be  simply  represented  in  terms  of  the  eigenvalue  and  eigenvectors 
Xi,  oci,  and  Pi  of  the  second-order  system,  as  seen  in  Eq.  (21),  in 
the  case.  Due  to  this  property,  the  eigenvector  sensitivities  can  be 
represented  in  a  more  compact  form  than  the  former  case  (L  =  Af 
case).  Comparison  of  Eqs.  (20)  and  (25),  especially  expressions 
for  bi7,  leads  to  the  conclusion  that,  if  an  efficient  algorithm  for ' 
solving  eigenvalue  problems  [Eqs.  (22)]  for  the  mechanical  second- 
order  system  is  available,  then  Eq.  (25)  will  be  more  effective,  since 
these  equations  do  not  nc^  full  information  on  the  left  eigenvectors, 
including  It  is  also  possible  to  utilize  this  property  in  Eq.  (14) 
for  the  L  =  A/  case;  however,  this  approach  involves  a  matrix 
inverse,  and  therefore  both  the  numerical  accuracy  and  efficiency 
will  be  degraded,  especially  for  large  mass  matrices. 

ni.  Approximation  Methods  in  Computing 
Modal  Derivatives 
Approximation  Method  for  First-Order  System 

The  formulas  for  eigenvector  derivatives  derived  in  the  previous 
section  requires  knowledge  of  all  2n  eigenvectors.  For  very  large 
structural  dynamic  systems,  it  is  well-known  that  only  a  lowest  fre¬ 
quency  subset  of  Nr  modes  (eigenvalues  and  eigenvectors)  may  be 
computed  accurately,  where  Nr  <K  n,  and  in  most  practical  appli¬ 
cations,  only  lens  of  the  lowest  frequency  modes  participate  signif¬ 
icantly  in  a  typical  dynamic  response  of  the  system.  It  is  natural  to 
conjecture  that  the  contributions  of  very  high  frequency  modes  to  the 
sensitivity  of  the  lower  eigenvectors  may  also  be  neglected  to  some 
degree  of  approximation.  If  we  consider  the  problem  that  deriva¬ 
tives  of  only  Nr  modes  are  really  needed,  then  a  method  all 


eigenvectors  may  lead  to  inefficiency  and  a  practical  difficulty  if  all 
of  the  eigenvectors  cannot  be  accurately  computed.  Fbr  this  case, 
an  approximate  method  for  computing  eigenvector  derivatives  has 
been  reported'^-”  by  utiliring  a  modal  truncation  method,  including 
only  a  subset  of  the  system  modes: 

(26) 


(27) 

dp 


where 


i=i 

J*i 


^  <t>iGi  j.  ,  v-'V','- 

an  =  “^  j 

d3 

bn  =  -V’f  -  V’f  BZi  -  an 

dp 


i’j 


(28) 


The  overbar  denotes  an  approximate  solution,  and  it  is  has  been 
found  that  the  approrimation  is  often  very  accurate  for  large  struc¬ 
tural  systems  where  there  exists  a  large  frequency  gap  between  the 
last  included  mode  {Nr)  and  the  next  higher  frequency  mode.  By 
utilizing  the  biorthogonality  conditions,  we  introduce  a  modifica¬ 
tion  of  the  above  results,  especially  in  the  terms  F;  and  Gf .  Our 
modification  follows. 


Eigenvector  Derivative  Approximation  Method 
for  Second-Order  Systems 

We  know  from  empirical  experience  that  the  above  approximation 
method  is  usually  efficient  for  computing  lower  mode  eigenvector 
derivatives.  In  this  section,  a  more  efficient  method  will  be  derived 
especially  for  second-order  systems  by  using  results  of  the  previous 
sections.  Again,  we  develop  here  approximation  expressions  only 
for  the  special  cases  that  the  mass  matrix  or  the  stiffness  matrix  are 
selected  for  the  L  matrix.  In  the  approximation  methods,  case  II 
(using  the  stiffness  matrix  for  L  matrix)  is  very  efficient;  thanks  to 
the  elegantly  simple  expressions  for  the  left  eigenvector  as  seen  in 
Eq.  (21),  and  therefore,  this  formulation  requires  much  less  arith¬ 
metic,  especially  for  computing  the  left  eigenvector  derivatives. 


Case  !:  L  =  Af 

The  exact  eigenvector  derivatives,  Eq.  (19),  can  be  rewritten  as 


^  =an<l>i+Zi, 
dp 


^  =  bni>i  +  «'/ 
dp 


(29) 


where 


z..  =  ^  a, = 


(30) 


/  =  i 
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The  eigenvalues  are  numbered  according  to  increasing  magnitude, 
and  we  assume  that  only  the  lower  W,  modes’  derivatives  are  re¬ 
quited  for  a  suitably  accurate  approximation.  Since  we  use  the  lower 
frequency  K  eigenvalues  and  eigenvectors,  the  higher  mode  eigen¬ 
vectors  (higher  than  the  lowest  N,  modes)  must  be  approwmated. 
gpp^irating^,  and  W)  in  Eqs.  (30)  into  two  parts,  the  first  term  includes 
the  lower  N,  mode  eigenvectors  that  may  be  computed  accurately 
and  the  second  term  includes  higher  mode  eigenvectors  that  will  be 
approximated;  this  yields 


JNr 

Nr  > 

W.-  =  bjjtpj  +  bijrpj 


Substituting  Eqs.  (20)  into  Eq.  (30)  and  using  the  property  for  the 
class  of  problems  with  a  large  frequency  gap, 

Xj  —  Xi  ^  Xj  for  j  >  Nf 

an  approximation  z/  can  be  written  as 

E 

*■'  j=N,+t 

t*'  ,  ,  (32) 

=  >  - 

J*i 

where 

/  ,3M  ,  3C  dK\ 

Since  is  a  scalar,  the  second  summation  on  the  right-hand 

side  of  Eq.  (32)  can  be  simplified.  To  do  this,  we  consider  the  spectral 
decomposition  of  the  A  matrix  using  Eqs.  (5-7): 

A  =  (34) 


h  ‘•I 


r  0  Ml 

-4-^  -c} 


A  —  diag(Xi) 


Equation  (35)  can  be  rewritten  using  Eqs.  (14)  and  (15)  as 

cej^f  ]  r-AT-'CA/-'  -/f-'l 

We  obtain  the  following  useful  relationship  from  the  above  equation: 

v'±r“'<i4-'^-'i  (38, 

Utilizing  Eq.  (38)  in  Eq.  (32),  we  obtain  the  final  approximation 
form  of  Zi  as 


Now  the  modal  representation  of  the  eigenvector  derivative  can  be 
approximated  as 

=  aii4>i  +Z/  •*'  (40) 


where  the  approximation  formula  for  5//  can  be  obtained  by  substi¬ 
tuting  Eqs.  (18)  and  (21)  into  the  formula  for  an  in  Eq.  (28), 

5,  _  -1  [(,  +  i;)ar^« 

+  aJ[M  (41) 

For  the  class  of  problems  that  we  are  dealing  with,  we  have  found 
the  above  approximate  solution  is  very  efficient  and  is  usually 
sufficiently  accurate  to  be  used  in  a  derivative-based  design  or  opti¬ 
mization  process.  It  is  straightforward  but  tedious  to  validate  these 
equations  using  finite  differences  or  by  retaining  all  of  the  eigen¬ 
vectors  in  the  corresponding  “exact”  formulas  developed  above 
(provided,  of  course,  that  it  is  computationally  feasible  to  solve 
the  full-order  eigenvalue  problem). 

Similarly,  the  derivatives  of  the  left  eigenvectors  can  be  computed 
using  the  following  modal  approximation: 


~  =  Biii>i+Wi 

3p 


f  1  „,t/  3Af  3C  3K\ 

{  1  „,r/  3M  ,  3C  dK\  ,mr3M| 


Note  that  we  have  made  use  of  the  following  useful  relations  for 
deriving  the  above  equations: 


I-  - 

^  V’y 1  -  r  ' 


Cose  ll:  L  =  K 

As  in  the  previous  section,  the  eigenvector  appro.ximalion  fonnula 
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for  this  case  is  much  simpler  than  in  case  I  (L  =  ")•  Sina  the 
derivation  is  quite  similar,  we  only  report  the  final  results  here. 

^  K  («) 


jabic  1  Configuration  parameters  of  flexible  beam 


i-t 


where 


and 

/  jOM  ^C  BK\ 

We  use  the  follovring  useful  relations  for  deriving  the  above 
equations: 


1  rjc-n 

a;/3jJ  L  0  J 


(46) 


Parameter 


IV.  Numerical  Example 

To  demonstrate  the  efficiency  and  accuracy  of  the  several  eigen¬ 
vector  derivative  formulas  developed  in  the  previous  sec^ns.  we 
consider  a  moderately  dimensioned  second-order  system.  The  exact 
[Eqs.  (8-11)1  and  approximate  (Eqs.  (26-28)]  methods  along  with 
the  new  formulafions  developed  for  second-order  systems  are  com¬ 
pared.  Eigenvector  derivatives  in  this  paper  were  computed  on  an 
IBM  PC-486DX  (33  MHz)  using  386  MATLAB®. 

To  provide  a  basis  for  comparison,  we  introduce  an  error  measure 
based  on  the  biorthogonality  conditions  of  Eqs.  (6).  The  partial 
derivatives  of  Eq.  (6)  forthei  =  j  case  with  respect  to  the  parameter 
arc  as  follows: 


(47) 


Value 


Units 


Mass  density 
Young’s  modulus 
Beam  length 
Moment  of  inertia 


0.0271875 

0.1584x10*® 

4.0 

4.7095  X  10-* 


slug^ft 

Ib/tf 

ft 

ft* 


and  of  course,  this  agreement  between  finite  deriraUve  approxi- 
mafion  and  analytical  formulas  is  problem  dependent  The  error 
measure  of  Eqs.  (47)  is  convenient;  it  provides  an  e^y-to-compute 
measure  without  requiting  a  problem-dependent  arftsUc  search  for 
“how  small”  to  make  a  finite  difference  parameter  increment  (5p). 
Generally,  the  error  values  computed  from  Eqs.  (47)  are  complex 
numbers,  and  we  define  an  eigenvector  derivative  error  measure  by 
simply  using  the  absolute  value  of  Eqs.  (47),  i.e.. 


It  is  obvious  that  if  computed  eigenvector  derivatives  are  accurate, 
then  as  a  necessary  condition,  the  above  equation  must « satisfied. 
Therefore,  an  error  measure  can  be  defined  as  a  norm  of  the  differ¬ 
ences  from  zero  when  computed  derivatives  are  substituted  into  the 
above  equations.  Although  this  is  only  a  necessary  condition  test  on 
the  validity  of  the  eigenvectors,  we  have  found  that  it  is  very  useful 
to  identify  poorly  approximated  eigenvector  derivatives  and  can  be 
routinely  computed  more  efficiently  than  foming  a  large  table  of 
finite  difference  approximations  and  comparing  them  to  the  coffe- 
sponding  analytic  derivatives.  We  mention,  however,  that  we  have 
done  extensive  finite  difference  validations  of  all  of  the  above  eigen¬ 
vector  derivative  formulas  with  typical  agreement  being  four  to  nine 
digits,  depending  upon  the  smallness  of  the  finite  difference^ steps, 


|3<^r  ,  ^  ,Tp3^/ 


(48) 


We  mention  the  obvious  fact  that  comparing  the  calailated  eigenvec¬ 
tor  derivatives  with  computed  results  using  finite  difference  approx¬ 
imation  requites  care  on  two  counts.  First,  since  computed  results 
using  a  finite  difference  approximation  are  not  exact  derivatives, 
it  usually  is  necessary  to  explore  the  size  of  the  appropnate  pa¬ 
rameter  increments,  and  if  the  finite  difference  approximation  of 
the  derivative  is  found  to  be  stable  to  at  least  four  sipficant  fig¬ 
ures  (rule  of  thumb)  over  an  order-of-magnitude  variation  m  the 
size  of  the  parameter  increment,  then  the  derivatives  are  iKually 
found  to  be  sufficiently  accurate  for  derivative-based  optimirauon 
processes.  However,  a  patient  pursuit  of  digits  in  Ae  finite  differ¬ 
ence  tuning  can  usually  result  in  much  higher  precision  agreement 
with  the  analytical  partials.  Avoiding  this  finite  difference  artwork 
is  of  course  a  primary  motivation  to  have  analytic^  pamal  denva- 
tives  and  analytical  necessary  condition  tests  such  as  (48)  to 
test  for  arithmetic  errors.  Second,  and  most  importantly.  Ae  nor¬ 
malization  conditions  (in  the  biorthogonality  conditions)  that  were 
enforced  in  deriving  the  eigenvector  derivative  formute  must  be 
enforced  on  the  nominal  and  perturbed  eigenvectors  used  m  the  fi¬ 
nite  difference  computations.  We  have  concluded  that  the  above 
error  norm  represents  an  attractive  necessary  condiUon  measure  tor 
checking  computed  eigenvector  derivatives  and  ram  ma^  ways 

more  attractive  than  comparisons  to  results  using  the  finite  difference 

method.  Therefore,  in  this  study,  the  error  measure  introduced  in 
Eq.  (48)  will  be  used  for  checking  accuracy  of  computed  eigenvector 

sensitivities.  _  . 

Consider  a  transverse  vibration  of  a  uniform  cantilever  beam, 
finite  element  method**  *'*  is  adopted  for  modeling,  and 
damping  (assumed  damping  ratio  of  0.001)  is  included,  „  - 
metric  and  material  parameters  of  the  beam  are  Irat^  m  Tab  el. 
To  demonstrate  the  effectiveness  of  the  new  methods  for  at  least 
moderately  high  dimensioned  problems,  20  elements  are  consid¬ 
ered,  and  therefore,  using  the  usual  cubic  spline  beam  elements  (  he 
system  configuration  coordinates  are  the  deflection  and  slope  at  the 
right  end  of  each  element),  the  dimension  of  the  mass,  dampin^,, 
and  stiffness  matrices  is  40  x  40.  In  order  to  evaluate  the  eigen- 
value/eigenvector  derivatives,  all  elements  of  the  luass,  ^*^P'  ®’ 
and  stiffness  matrices  are  perturbed  about  0.1%  arbitranly  for  t 
special  example,  and  the  errors  of  the  eigenvector  sensitivities  du 
to  the  perturbation  are  given  below.  Note  that  eigenvector  den  - 
lives  are  calculated  for  the  normalized  eigenvectors,  and  for  tni 
special  example  the  norm  of  the  eigenvector  is  of  order  1  for  at 
modes,  and  the  norms  of  the  eigenvector's  derivatives  are  ofordcr  i 
for  the  low-frequency  modes  and  of  order  2  for  the  high-frequency 
modes  Therefore,  it  is  evident  that  a  computed  result  accurate  to 
better  than  seven  digits  in  the  worst  case  was  obtained  using  the 
three  alternative  formulas  developed  above  for  c.xact  eigenvector 
derivatives. 
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Table  2  Errors  of  right  eigenvector  derivatiYes 


Second-order  method 

Mode 

First-order  method 

Method  P 

Method  n** 

1 

0.1655x10-“ 

56456x10-* 

Z6324X10-* 

2 

0.0083x10-“ 

01870x10-* 

0.4660x10-* 

3 

0.0003  X 10-“ 

1.2028x10-* 

0.1214x10-* 

4 

0.0001  X 10-“ 

0.4148  X  10"* 

0.0227  X 10-* 

40 

0.4805  X 10-“ 

1.9319x10-* 

3.6814x10-" 

Times,  s 

879.68 

310.27 

301.04 

Percent 

100 

3527 

34.22 

»UseL  = 

M.  '*U$e  L^=K. 

Table  3  Errors  of  left  eigenvector  derivatives 

Second-order  method 

Mode 

Rrst-order  method 

Method  I* 

Method  11'’ 

1 

1.4541  X  10"’ 

13319x10-* 

1.3845  X 10-* 

2 

0.0946x10-* 

0.1707x10-* 

0.0018  X  10-* 

3 

0.1575  X  10-® 

03062x10-* 

0.0023  X  10-* 

4 

0.1076x10-* 

0.0512  X  10-* 

0.0165  X  10-* 

40 

35224  X  10-* 

3.4451  X  10-* 

3.2120  X  10-* 

Time,  s 

1187.11 

433.48 

393.43 

Percent 

100 

36.52 

33.14 

*UscL  =  Af.  *‘UseL  =  ^:. 


The  error  of  the  right  and  left  eigenvector  derivatives  using  the 
exact  formulas  are  summarized  in  Tables  2  and  3,  respectively,  and 
the  error  measures  of  the  first  four  lower  modes  and  the  highest 
(40th)  mode  are  reported.  For  the  first-order  method,  we  use  the  mass 
matrix  Af  for  the  L  matrix  and  apply  the  exact  formula  Eqs.  (8-1 1) 
with  Eqs.  (3)  and  (4).  Note  that  for  computing  the  left  eigenvector 
derivatives,  we  use  partial  computations  (a/,)  from  the  calculation 
of  right  eigenvector  derivatives,  and  therefore  the  errors  (of  the  right 
eigenvector  derivatives)  propagate  into  the  computation  of  the  left 
eigenvector  derivatives.  It  is  evident  that  some  of  the  information 
needed  on  the  left  eigenvector  derivative  is  already  known  from  the 
right,  and  this  valuable  information  can  be  utilized  for  computing 
left  eigenvector  sensitivities.  However,  left  eigenvector  sensitivities 
cannot  be  computed  without  former  computations  of  a,-,-.  Therefore 
computing  time  for  left  eigenvector  sensitivities  includes  calculating 
all  Oij  coefficients  [for  computing  an,  we  need  aij(i  #  j)],  and 
naturally  more  computing  time  is  needed  for  computing  the  left 
eigenvector  derivatives. 

There  are  several  formulas  for  computing  the  eigenvector  sensi¬ 
tivities  discussed  in  this  paper.  We  will  refer  the  exact  and  approx¬ 
imate  methods  to  the  existing  exact  and  approximate  formulas  for 
computing  eigenvector  sensitivities,  respectively.  For  the  presented 
methods  for  the  second-order  systems,  whether  the  exact  formula 
or  the  approximate  formula  is  used,  method  I  refers  to  the  case  that 
mass  matrix  M  is  used  for  matrix  L,  and  method  II  refers  to  the  case 
that  stiffness  matrix  M  is  used  for  matrix  L,  As  shown  inTable  2,  the 
accuracy  of  method  I  is  lower  than  that  of  the  first-order  method,  but 
both  are  acceptable.  The  errors  of  right  eigenvector  sensitivities  us¬ 
ing  method  II  are  not  uniform  and  are  a  little  larger  than  for  method  I. 
The  computation  time  for  the  second-order  method  is  three  times 
less  than  the  computation  time  required  for  the  exact  formula  for  the 
first-order  system  (Tables  2  and  3).  In  this  study,  in  order  to  com¬ 
pute  eigenvalues  and  eigenvectors  for  this  second-order  system,  we 
use  an  eigenproblem  solver  for  the  first-order  system,  and  the  B  ma¬ 
trix  in  Eq.  (4)  is  moderately  ill-conditioned  (the  condition  number 
is  0(1  o’)].  For  method  II,  the  poor  conditioning  of  the  B  matrix 
results  from  the  fact  that  not  only  the  dimension  of  B  matrix  is  large 
(i.e.,  80).  but  also  the  order  of  magnitude  of  mass  matrix  elements 
is  significantly  different  from  that  of  stiffness  matrix  elements.  For 
this  system,  the  computed  eigenvectors  also  include  errors;  this  is 
evident  by  nonzero  residuals  if  one  substitutes  the  computed  eigen¬ 
vectors  into  the  biorlhogonality  conditions.  Especially  for  large 
systems,  errors  may  be  [)ropagaicd  from  incorrect  eigenvector  com¬ 
pulations  into  the  analytically  derived  formulas  for  the  eigenvector 


Table  4  Errors  of  right  eigenvector  derivatives: 
approrimation  methods 


Second-order  method 

Mode 

First-order  method 

Method  P 

Method  n'’ 

1 

0.0831  X  10-“ 

53276x10-* 

1.3592x10-" 

2 

0.0018  X  10-“ 

0.4207x10-* 

0.0122x10-" 

3 

0.0017  X  10-“ 

0.6170  X  10"* 

0.0022x10-" 

4 

0.0011  X  10-“ 

0.1600x10-* 

0.0010x10-" 

5 

0.0002x10-“ 

0.0402  X  10-* 

0.0004x10"“ 

Time,  s 

14.61 

2.69 

2.80 

Percent 

100 

18.41 

19.16 

*u«t  = 

Af,  *‘Usc£.  =  a:. 

Table  5  Errors  of  left  eigenvector  derivatives: 

approximation  methods 

Second-order  method 

Mode 

First-order  method 

Method  P 

Method  II" 

1 

0.0007  X  l0-“ 

13653x10-* 

3.0959x10"“ 

2 

0.0111x10"“ 

0.0585  X  10"* 

0.0160x10"“ 

3 

0.0014x10"“ 

0.1310  X  10"* 

0.0015  X  10"“ 

4 

0.0004x10"“ 

0.0440x10-* 

0.0004  X  10"“ 

5 

0.0002x10"“ 

0.0123  X  10"* 

0.0001  X  10"“ 

Time,  s 

27.13 

18.07 

8.84 

Percent 

100 

66.61 

325Z 

»Usc  L  =  Af.  *»Use  L  =  K. 


derivatives.  Thus  the  validity  of  the  derivative  approximation  rests 
not  only  upon,  for  example,  including  all  of  the  important  modes  in 
a  modal  truncation,  but  also  upon  the  manner  in  which  arithmetic 
errors  in  the  original  eigensolution  propagate  through  the  particu¬ 
lar  derivative  equation  calculations.  From  these  observations,  and 
other  empirical  experience,  we  recommend  that  method  IT  should 
be  used  only  for  relatively  low  dimensioned  systems,  and  method  I 
is  recommended  for  high-dimensioned  applications. 

For  the  eigenvector  derivative  approximation  methods,  only  the 
first  five  (lowest  frequency)  modes  (Nr  —  10)  are  computed. 
Tables  4  and  5  summarize  the  results  using  our  (improved)  approx¬ 
imation  methods.  The  errors  of  methods  I  and  II  are  larger  than 
those  for  (improved)  approximation  method  for  the  first-order  sys¬ 
tem  but  are  judged  acceptable  for  most  applications.  As  shown  in 
Tables  4  and  5,  when  we  use  approximation  method  I,  the  compu¬ 
tation  time  for  computing  the  right  eigenvector  derivatives  is  five 
times  faster  than  the  approximation  method  for  the  first-order  sys¬ 
tem,  and  for  computing  the  left  eigenvector  derivatives,  it  is  approx¬ 
imately  twice  as  fast.  Approximation  method  II  is  found  to  be  much 
faster  than  method  I,  and  the  computation  errors  are  also  smaller. 
Another  interesting  phenomenon  is  that  the  results  using  the  ap¬ 
proximation  methods  (Tables  4  and  5)  for  the  lower  modes  turn 
out  to  be  more  accurate  than  those  of  the  exact  (in  theory)  meth¬ 
ods  (Tables  2  and  3).  We  may  explain  this  phenomenon  by  noting 
that  numerically  inexact  computed  eigenvectors  associated  with  the 
higher  frequencies  are  included  in  evaluating  the  exact  formulas, 
but  not  in  the  approximate  solution,  and  another  contributing  factor 
is  that  the  approximate  method  is  much  less  intense  computation¬ 
ally,  and  therefore  the  approximate  formulas  are  less  susceptible 
to  the  accumulation  of  arithmetic  errors.  These  results  provide  a 
basis  for  optimism  as  regards  the  practical  utility  of  the  new  ap¬ 
proximate  eigenvector  derivative  formulas  presented  herein,  but  as 
with  any  modal  truncation  method,  the  issue  of  which  modes  to 
retain  is  problem  dependent  and  generally  impossible  to  resolve 
universally. 

V.  Conclusions 

This  paper  derives  some  new  exact  and  approximate  formulas 
for  computing  eigenvector  derivatives  for  second-order  mechanical 
systems.  In  order  to  demonstrate  the  cffcciivcticss  and  accuracies 
of  the  new  formulas,  a  numerical  study  using  a  moderately  liigh 
dimensioned  flexible  structure  is  presented.  Tlie  usefulness  of  the 
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new  methods  has  been  verified  by  comparing  computation  time  to 
the  corresponding  computation  time  for  the  exact  formulas  for  the 
first-order  system,  and  the  accuracy  of  the  new  methods  has  also 
been  found  to  be  excellent  in  the  current  example.  These  formu¬ 
lations  are  suitable  for  incorporation  into  iterative  computer-mded 
design  optimization  algorithms  and  should  find  wide  application. 

Acknowledgments 

This  work  was  partially  supported  by  the  U,S,  Air  Force  Office 
of  Scientific  Research  under  Contract  F49620-92-J0496;  technical 
discussions  with  S,  Wu  and  J.  Chang  are  warmly  acknowledged 
We  are  also  pleased  to  acknowledge  the  lustorical  motivations  of 
the  work  by  B.  Wang  and  K.  Lim, 

References 

^Collins,  J.  D.,  Hart.  G.  C„  Hassciman,  T.  IC,  and  Kennedy,  B.,  “Sta¬ 
tistical  Identification  of  Structures^  AIAA  Journal,  Vol.  12.  Feb.  1972,  pp. 
jg5— 190 

^Bemian.  A.,  and  HanneUy.  W.  “Theory  of  Incomplete  Models  of 
Dynamic  Structures.”  AMA  Journal,  Vol.  9,  No.  8. 1971,  pp.  1481-1487. 

^Haftka,  R.  T,  Maitinovic,  Z.  N.,  Hallauer,  W.  L.,  and  Schamel.  G.,  “Sen¬ 
sitivity  of  Optimized  Control  Systems  to  Minor  Structural  Modifications.” 
AIAA  Paper  85-0807,  April  1985. 

^Lim,  K.  B..  and  Junldns,  J.  L.,  “Optimal  Redesign  of  Dynamic  Structures 
via  Sequential  Linear  Programming.”  Proceedings  of  the  Fourth  Interna- 
tional  Modal  Analysis  Conference  {Los  Ang,cies,CA),  1986,  pp.  1615-1620. 

^Chou,  Y.-F.,  and  Chen,  J.-S.,  “Structural  Dynamics  Modification  via 
Sensitivity  Analysis,”  Proceedings  of  the  Third  International  Modal  Analysis 
Conference  (Orlando,  FL).  Vol.  1, 1985.  pp.  483-489. 

^Jin,  I.  M..  and  Schmit.  L.  A„  “Control  Design  Variable  Linking  for 
Optimization  of  StructuraVControl  Systems”  AIAA  Paper  91-1157,  April 

1991.  •  .  ,  . . 

‘^Bodden.  D.  S.,  and  Junkins.  J.  L.,  “Eigenvalue  Optimization  Algonlhms 
for  Stnictural/Controller  Design  Iterations.”  Journal  of  Guidance,  Control, 
and  Dynamics,  Vol.  8,  No.  6, 1985,  pp.  697—706. 

^Junkins.  J.  L.,  Bodden,  D.  S..  and  Turner.  J.  D..  “A  Unified  Approach 
to  Structure  and  Control  System  Design  Iterations.”  pr^ented  at  the  Fourth 
International  Conference  on  Applied  Numerical  Modeling,  Tainan,  Taiwan, 
ROC.  Dec.  1984. 


’Lim,  K.  B.,  and  Junkins.  J.  L.,  “Optimal  Design  of  Dynamic  Structures 
via  Sequential  Uncar  Programming."  presented  at  the  Fburth  International 
Modal  Analysis  Conference,  Los  Angeles,  CA,  Feb.  1986. 

Wjunkins.  J.  L..  and  lOm,  Y..  “Fust  and  Second  Order  Sensitivity  of  the 
Singular  Value  Decorapositioa,"  Journal  oftheAstronaudeal  Sciences,  Vol, 

38.  No.  1, 1990,  pp.  69-86,  w  a. 

“Junldns.  J,  L.,  and  Kim.  Y„  “A  Minimura  ScnsiUvity  Design  Method 
for  Output  ftedback  Controllers,”  Mechanics  and  Control  of  Large  Space 
Srrucfuw.Progiessin  Astronautics  and  Aeronautics  Scries.  Vol,  129,  AIAA, 
Washmgton,DC  1990.  Chap.  15. 

“Sobel,  K.  M.,  Yu,  W„  and  Lallman,  F,  J.,  “Eigenstnicture  Assignment 
with  Gaun  Suppression  Using  Eigenvalue  and  Eigenvector  Derivatives.” 
Journal  of  Guidance,  Control,  ondDynamics,  Vol.  13,  No.  6. 1990,  pp.  1008- 

1013.  ^ 

“Nelson.  R.  B.,  “Sin^lified  Calculation  of  Eigenvector  Denvatives, 

AIAA  Journal,  Vol.  8.  No.  9. 1976.  pp.  1201-1205. 

“Fox.  R.  L.,  and  Kapoor.  M.  P..  “Rates  of  Change  of  Eigenvalues  and 
Eigenvectors,”  AIAA  Journal,  Vol.  6,  No.  12, 1968,  pp,  2426—2429. 

“Urn,  K.  B.,  Junkins,  J,  L..  and  Wang,  B.  P.,  “Re-examinadon  of  Eigen¬ 
vector  Derivatives.”  Journal  of  Guidance,  Control,  and  Dynamics,  Vol.  10. 
No.  6, 1987.  pp.  58 1-587. 

“Dailey,  R.  L.,  “Eigenvector  Derivatives  with  Repeated  Eigenvalues, 
AIAA  Journal,  Vol.  27,  No.  4, 1989,  pp.  486-49 1 . 

“Wang,  B,  P..  “Improved  Approximate  Methods  for  Computing  Eigen¬ 
vector  Derivatives  in  Structural  Dynamics,”  AIAA  Journal,  Vol.  29.  No.  6, 

1991,  pp.  1018-1020.  .  T  u 

“Golub.  G.  H..  and  Van  Loan,  C.  F.,  Matrix  Computations,  2nd  ed.,  Johns 
Hopkins  Univ.  Press,  Baltimore,  MD,  1989,  Sec,  7.2. 

“Wilkinson,  J.  H.,  The  Algebraic  Eigenvalue  Problem,  Oxford  Univ. 
Press,  New  York,  1965,  Chap.  2. 

2«Stewart,  G.  W.,  Introduction  to  Matrix  Computations,  Academic.  New 

York,  1965,  Sec.  6.4.  . 

Kato,  T.,  Perturbation  Theory  for  Linear  Operators,  2nd  ed..  Spnnger- 

Verlag,  New  York,  1976. 

^Sun,  J.,  “Eigenvalues  and  Eigenvectors  of  a  Matrix  Dependent  on  Sev- 
eral  Param^ers,”  Journal  of  Computational  Mathematics,  Vol.  3,  No.  4. 
1985.  pp.  351-364. 

“Craig,  R..  Jr.,  Structural  Dynamics:  An  Introduction  to  Computer  Meth- 
o<is.  Wiley,  New  Yoilc,  1991. 

“Meirovitch,  L.,  Computational  Methods  in  Structural  Dynamics, 
Sijthoff  and  Noordhoff,  The  Netherlands,  1980. 


Optimal  Control  of  Second 
Order  Dynamical  Systems 


John  L.  Junkins  and  John  E.  Hurtado 
Texas  A&M  University 


AIAA  Structures,  Structural  Dynamics,  and 
Materials  Conference 

April  1995 
New  Orleans,  LA 


250 


OPTIMAL  CONTROL  OF 
NATURAL  SECOND  ORDER  SYSTEMS 


Johnny  E.  Hurtado*  and  John  L.  Junkinst 
Texas  A&M  University,  College  Station,  TX  178^3 


Abstract 

This  modest  note  presents  the  necessary  condi¬ 
tions  related  to  the  optimal  control  of  natural  sec¬ 
ond  order  systems.  The  development  includes  sys¬ 
tems  subject  to  holonomic  constraints.  For  natu¬ 
ral  systems,  the  second  order  form  of  the  governing 
differential  equations  are  augemented  to  the  perfor¬ 
mance  index,  and  as  a  consequence,  the  resulting 
adjoint  system  defining  the  necessary  conditions  of 
optimality  is  also  second  order  in  form.  For  natural 
systems  subject  to  holonomic  constraints,  the  sec¬ 
ond  order  differential  equations  of  motion  and  the 
algebraic  equations  of  constraint  are  augemented  to 
the  performance  index.  Following  the  usual  meth¬ 
ods,  we  find  that,  like  the  original  dynamical  system, 
the  resulting  adjoint  system  is  also  holonomically 
constrained.  We  propose  an  augmented-Lagrangian 
method  to  numerically  solve  the  coupled  set  of 
differential-algebraic  equations  within  the  solution 
of  the  two-point  boundary  value  problem. 

Introduction 

A  significant  class  of  problems  in  analytical  me¬ 
chanics  fall  under  the  heading  of  natural  systems. 
These  include  robotic  and  satellite  systems  wherein 
the  joint  angles  between  substructures  may  undergo 
large  rotations.  Many  times,  the  governing  differen¬ 
tial  equations  of  these  systems  are  subject  to  holo¬ 
nomic  constraints.  That  is,  the  equations  of  mo¬ 
tion  are  formulated  such  that  the  generalized  co¬ 
ordinates  are  not  independent,  but  rather  they  are 
related  thru  algebraic  equations. 

Traditionally,  vis-a-vis  optimal  control  formula¬ 
tions,  natural  systems  are  treated  no  differently:  the 
equations  of  motion  are  cast  into  first  order  form, 
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and  following  the  usual  variational  calculus  tech¬ 
niques,  one  arrives  at  the  adjoint  system  of  first  or¬ 
der  differential  equations  which  must  be  satisfied  to 
meet  the  necessary  conditions  of  optimality.^ 

When  the  dynamical  system  is  subject  to  holo¬ 
nomic  constraints,  the  optimal  control  formulation 
often  begins  with  manipulating  the  governing  equa¬ 
tions  by  one  of  three  methods  before  the  usual  pro¬ 
cedures  for  arriving  at  the  necessary  conditions  for 
optimal  control  are  applied.  In  the  first  method,  the 
holonomic  constraints  are  used  to  eliminate  redun¬ 
dant  coordinates  algebraically  and  the  equations  of 
motion  Bie  formulated  using  a  minimal  coordinate 
description  of  the  system.  Because  these  formula¬ 
tions  rely  upon  a  minimal  set  of  coordinates,  the 
resulting  system  is  no  longer  explicitly  constrained. 
In  a  second  method,  locally  equivalent  to  the  first, 
the  generalized  coordinates  undergo  a  judicious  non¬ 
linear  coordinate  transformation.  In  these  new  co¬ 
ordinates,  the  constreiints  are  trivially  satisfied  leav¬ 
ing  a  subset  of  differential  equations  which  are  not 
subjected  to  constraint  forces.^  A  third  approach 
begins  by  differentiating  the  holonomic  constraint 
equations;  the  result  is  arranged  as  a  linear  opera¬ 
tion  on  the  generalized  coordinate  acceleration  vec¬ 
tor.  This  allows  the  elimination  of  the  Lagrange 
multipliers  appearing  in  the  differential  equations  of 
motion  in  favor  of  nonlinear  functions  of  the  gener¬ 
alized  coordinates,  velocities  and  controls.  This  ap¬ 
proach  is  known  as  either  a  range  space  or  null  space 
formulation  depending  on  the  particular  method  of 
elimination  used.® 

All  three  of  the  above  methods  result  in  a  “con¬ 
straint  free”  form  of  the  system  differential  equa¬ 
tions  of  motion  wherein  the  generalized  coordinates 
may  be  considered  independent.  As  mentioned,  sub¬ 
sequent  to  these  manipulations,  the  usual  proce¬ 
dures  for  deriving  expressions  for  the  optimal  con¬ 
trol  may  be  applied.  For  all  but  trivial  examples, 
however,  these  methods  lead  to  almost  intractable 
governing  equations. 

Below,  we  formulate  the  optimal  control  prob¬ 
lem  for  natural  systems  in  second  order  form.  As 
a  consequence,  because  the  resulting  coupled  differ¬ 
ential  equations  are  in  second  order  form,  in  solving 


them  we  may  use  one  of  the  many  implicit  integra¬ 
tion  schemes  available.®*^  These  schemes  were  espe¬ 
cially  designed  for  mechanical  systems.  For  systems 
subject  to  holonomic  constraints,  we  pursue  a  differ¬ 
ent  avenue  towards  the  optimal  control  than  those 
methods  outlined  above.  Our  approach  is  driven  by 
the  desire  to  avoid  nonlinear  transformations  of  the 
generalized  coordinates  or  the  elimination  of  the  La¬ 
grange  multipliers  from  the  differential  equations  of 
motion. 


Natural  systems  are  identified  as  those  for  which 
the  kinetic  energy  is  expressed  as  a  quadratic  func¬ 
tion  of  the  generalized  velocities.  Specifically, 

T  = 

Here,  rriij  is  the  symmetric,  positive  definite  mass 
matrix  and  is  seen  to  be  a  function  of  the  generalized 
coordinates  q — we  adopt  the  convention  that  repeti¬ 
tion  of  an  index  in  a  term  will  denote  a  summation 
with  respect  to  that  index  over  its  range.  Using 
Lagrangian  mechanics  to  develop  the  equations  of 
motion  begins  with  forming  the  system  Lagrangian 
as  the  difference  between  the  kinetic  and  potential 
energies, 

where  the  potential  energy  V  is  generally  a  nonlin¬ 
ear  function  of  the  generalized  coordinates.  Upon 
identifying  any  generalized  forces  which  do  noncon¬ 
servative  work,  the  form  of  Lagrange’s  equations  be¬ 
come 


The  Qk  are  nonconservative  generailized  forces  act¬ 
ing  on  the  system  and  they  are  often  generated  by 
a  linear  operation  on  a  vector  of  control  inputs  via 
Qk  =  Bkm  The  matrix  Bkm  is  often  called  the 
control  influence  matrix. 

Performing  the  implied  differentiation  above,  the 
differential  equations  of  motion  are 

dV 

^7  +  qi  qj  +  =  Bkm  (2) 

where  the  third  order  tensor  Hkij  is  commonly 
referred  to  as  the  Christoff  el  operator  of  the  first 
kind  and  is  defined  as 

jj  def  1  /  dmki  drukj  dmij . 

“  2  ^  dqj  dqi  dqk 


It  is  convenient  to  denote  rhij{q)  as  elements  of 
the  inverse  of  the  mass  matrix  (i.e.  rhik  mkj  = 
Sij)j  which  allows  us  to  write  the  governing  set  of 
equations  as 

+  ht(5,  $)  +  ^i(?)  =  (3) 

where 

hi{q,q)  =  mk{q)Tlkij{q)qiqj, 

9i{q)  '*=  ’^*=(9)  and 

6.m(9)  mk{q)Bkm- 

As  mentioned  earlier,  in  many  system  represen¬ 
tations  the  generalized  coordinates  q  are  not  inde¬ 
pendent,  but  rather  they  are  related  thru  a  set  of 
nonlinear  holonomic  constraint  equations  given  by 

(Po{q)  =  0. 

Now,  because  the  coordinates  are  not  independent, 
one  must  account  for  the  constrednt  forces  which 
restrict  the  time/space  evolution  of  the  system. 
This  is  done  by  representing  the  constraint  forces 

d<p0  . 
dqk 

where  Ao  are  elements  of  a  time  varying  vector 
of  Lagrange  multipliers  which,  when  determined 
correctly,  enforce  the  holonomic  constraints  of  the 
system.  Physically,  the  normal  component  of  the 
constraint  force  is  proportional  to  the  gradient  of 
the  constraint  function.  These  constraint  forces  are 
added  to  the  right-hand  side  of  eq.(l),  and  so,  in  the 
present  context  the  constrained  dynamical  system  is 
described  by  the  set  of  equations 


subject  to  (po{<l)  =  0*  (^) 

or,  performing  the  implied  differentiation, 

qi  +  hi{q,q)-{- gi{q)  = 

bim{q)um‘^dio{q)Xo  (6) 

subject  to  fPo{q)  =  (^) 

where 

,  /  .  d«f  -  /X  d(po 

dioiq)  =  rriikiq) 

We  emphasize  that  this  set  of  differential-algebraic 
equations  given  by  eqs.(4)  and  (5)  must  be  solved 
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simultaneously  for  the  unknown  vectors  q{t)  and 

X{t). 

We  next  pose  the  optimal  control  statement  for 
the  natural  systems  described  above. 


Necessary  Conditions  for  Optimal  Control 

The  necesseiry  conditions  for  optimal  control  are 
almost  universally  derived  with  the  equations  of  mo¬ 
tion  in  first  order  form.  Below,  we  use  the  tech¬ 
niques  of  variational  calculus  to  obtain  the  necessary 
conditions  for  the  natural  second  order  systems  in¬ 
troduced  in  the  previous  section.  We  begin  with  the 
system  governed  by  eq.(3)  and  then  focus  on  the 
holonomically  constrained  system  given  by  eqs.(6) 
and  (7). 

The  problem  statement  is  the  minimization  of  a 
given  performance  index  subject  to  the  dynamical 
equation  constraints.  We  consider  a  performance  in¬ 
dex  which  contains  terms  that  are  quadratic  in  the 
generalized  positions,  generalized  velocities,  con¬ 
trols  and  control  rates:  including  the  control  rate 
term  allows  one  to  specify  the  value  of  control  at 
the  beginning  and  end  of  the  manuever.  Appending 
the  dynamical  equations  to  the  performance  index 
results  in 


order  state  equations  must  be  satisfied.  Next,  be¬ 
cause  the  variations  of  qj  are  independent  and  arbi¬ 
trary  throughout  the  integration  interval  while  their 
respective  multipliers  are  continuous,  these  multi¬ 
pliers  must  be  indenticedly  zero.®  Similar  reasoning 
applies  in  regarding  the  variations  of  Utn*  These  ar¬ 
guments  provide  us  with  the  second  order  costate 
(or  adjoint)  differential  equations,  and  a  differential 
optimality  condition. 

Original  system: 

qi  +  hi{q,q)+gi{q)  =  bii{q)ui.  (9) 


Adjoint  system: 


d ,  dhi  t  ,  ^9i 
dqj 

Optimality  condition: 


(10) 


Plm  Ul  H-  Rim,  Ul  =  Vi  br 


(11) 


AU  that  is  remaining  is  the  satisfaction  of  the 
boundary  terms  (transversality  conditions)  which 


f  [^QljQi9j  +  ^Qij9iqj 

require 

dh 

tf 

Jto 

^Rtm  Ul  Um  +  |P<»n  Ul  Um  +  Vi  (  -9i 

[C?i;9i  +  Vi 

-Vi^]Sqj 

dqj 

=  0; 

0 

Vi  Sqi 

=  0 


—  hi{qi  g)  —  giio)  +  &im(9) 

where  Vi  is  a  time-var3dng  vector  of  Lagrange  mul¬ 
tipliers,  while  Qfj-,  QJj,  Rim  and  P/m  represent  el¬ 
ements  of  the  weight  matrices  which  axe  defined  in 
the  usual  way.  Limiting  ourselves  to  smooth,  un¬ 
bounded  controk  while  taking  the  first  variation 
yields 


and  Pim  Sum 


=  0. 


(12a -c) 


In  considering  natural  systems  subject  to  holo- 
nomic  constraints,  we  closely  follow  the  develop¬ 
ments  above.  We  begin  by  appending  eqs.(6)  and 
(7)  to  the  performance  index  which  results  in 


SJ  =  [  <? -j-  qi  +  ]  Sqj 


-  Vi  Sqi 


1*/  ft/ 

+  PlmVlSUm\  +  /  [  (  qi  “  Qij  9» 

lo  Jtt 

d  ,  dhi .  dhi  dgi 

Um  )  bqj  +  (  — Pjm  Vl  +  Rim 


+  Vi 


dt 
dbim 
dqj 


+  Vi  bim  )  Sum  +  (  -?<  -  hi{q,  q)  -  gi(q) 
-hbim(q)um)bvi]dt  =  0,  (8) 


where  we  have  performed  an  integration  by  parts  to 
eliminate  Sqjf  Sqij  and  Surn  from  the  integrand.  In¬ 
vestigating  eq.(8),  we  first  comment  that  the  second 


f  \hQij9i9}  +  2Qh9i9j 

Jto 

"1“  "^Rlm  V^m  “1“  2 Vmi 

+  Vi  {-qi  -  hi{q,  q)  -  gi{q)  +  6<m(9)  Um 
+  dio{q)  Ao)  +  7o  ¥>o(g)  ]  dt. 

Here  Vi  and  7o  are  time-varying  vectors  of  Lagrange 
multipliers.  Taking  the  first  variation  of  this  equa¬ 
tion  while  performing  an  integration  by  parts  to 
eliminate  Sqj,  Squ  and  6um  &om  the  integrand  leads 
to 


dh' 

«/ 

0 


Vi  6qi 


to 


+  P/m  Ul  SUm 


Jto 

d ,  dhi .  dhi  dgi 

+  jTV"*  a - ^ 

dt  ^Qj 

+  Vi  -T —  Urn  +  Vi  -5 - Ao  +  7o  )  oqj 

aqj  oqj  oqj 

+  Vidio(g)  6\o]dt 


+  /  [{Plm^l  + RlmUl  +  Vibim)SUmdt. 

Jto 

Note  that  in  the  above  statement  we  have  al¬ 
ready  imposed  the  requirement  that  the  differential- 
algebraic  equations  which  govern  the  original  dy¬ 
namical  system  must  be  satisfied  throughout  the  in¬ 
tegration  interval.  Now,  arguments  similar  to  those 
mentioned  in  the  previous  discussion  lead  us  to  a 
set  of  second  order  costate  (adjoint)  differential- 
algebraic  equations  and  a  differential  optimality 
condition. 


Original  system: 


qi  +  hi{q,q)  +  9i{q)  = 

bim{q)um  +  <iio{q)  Ao 

subject  to  <Po{q)  =  0- 

Adjoint  system: 

d.  dhi,  ,  dhi  dgi 

dhim  ddio  ^  ^ 

Um  -  — - Ao  ) 


dqj 


dqj 


=  Q^jqi-Qijqi  +  ^nfo 

subject  to  Vi  dio{q)  =  0. 
Optimality  condition: 


(13a) 

(13b) 


(14a) 

(14b) 


Pi  m  u;  +  i2/  mV/  —  Vi  bi-i 


(15) 


The  corresponding  boundary  terms  are  identical 
to  those  given  earlier  except  that  now,  like  the 
differential  equations,  these  boundary  conditions 
must  be  satisfied  subject  to  eqs.(13b)  and  (14b). 


Numerical  Solution  of  the  TPBVP 

The  set  of  equations  defining  the  necessary  con¬ 
ditions  for  optimal  control  represent  a  two-point 
boundary  value  problem.  In  most  nonlinear  prob¬ 
lems  of  practical  interest,  this  system  of  equations 


must  be  solved  numerically.  While  there  are  many 
different  numerical  methods  which  may  be  applied^ 
(the  method  of  particular  solutions,  polynomial  ap¬ 
proximation  methods,  quasi-linearization  methods, 
etc.),  we  use  the  shooting  method  in  the  examples 
that  follow.  But  rather  than  focus  on  the  numeri¬ 
cal  technique  used  to  attack  the  two-point  bound¬ 
ary  value  problem,  we  look  to  the  necessary  condi¬ 
tions  in  their  second  order  form  to  see  if  any  advan¬ 
tages  are  offered  within  the  solution  of  the  two-point 
boundary  value  problem. 

Beginning  with  a  natural  system  whose  motion 
is  governed  by  eq.(3),  we  recall  that  the  necessary 
conditions  for  optimal  control  are  given  by  eqs.(9) 
thru  (11)  and  the  boundary  conditions  eq.(12).  One 
possible  advantage  to  the  second  order  develop¬ 
ment  may  be  that  because  the  differential  equations 
are  in  second  order  form,  one  may  take  advantage 
of  some  peirticular  implicit  integration  schemes.®’^ 
These  schemes  were  especially  designed  with  natu¬ 
ral  systems  in  mind. 

Concerning  a  natural  system  subject  to  holo- 
nomic  constraints,  we  recall  that  the  necessary  con¬ 
ditions  for  optimal  control  are  given  by  eqs.(13) 
thru  (15)  and  the  boundary  conditions — recall  that 
the  equations  came  about  by  electing  not  to  per¬ 
form  a  nonlinear  transformation  of  the  generalized 
coordinates  or  eliminate  the  Lagrange  multipliers 
which  enforce  the  constraint  forces.  These  equa¬ 
tions  are  indentified  as  differential-algebraic  equa¬ 
tions  and  their  solution  requires  careful  attention. 
While  numerical  solutions  strategies  for  differential- 
zilgebraic  equations  have  been  the  focus  of  research 
for  some  years,  a  penalty  solution  method  has  re¬ 
cently  shown  considerable  promise. 

Historically,  the  primary  use  of  augmented  La- 
grangian  methods  has  been  in  obtaining  solutions 
to  time  independent  problems  that  are  subject 
to  constraints.  Recently  however,  these  meth¬ 
ods  have  been  extended  to  address  the  differential- 
algebraic  equations  which  arise  in  multi-body  dy¬ 
namic  formulations.^^*^^  Moreover,  analysis  for  very 
general  nonlinear  dynamical  systems  has  been  con¬ 
ducted  which  not  only  proves  convergence,  but  es¬ 
tablishes  bounds  on  the  rate  of  convergence  of  the 
method.^^ 

The  general  strategy  of  augmented  Lagrangian 
methods  is  iterative  and  involves  approximating  the 
constraint  forces  and  the  Lagrange  multipliers  which 
enforce  them.  The  approximate  multipliers  are  up¬ 
dated  based  upon  a  measure  of  constraint  violation. 
When  applied  to  constrained  dynamical  systems, 
the  solution  process  can  be  viewed  os  quasi-static 
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in  nature.  Specifically,  an  iteration  process  is  trig¬ 
gered  at  each  time  step  wherein  the  postions  and 
velocities  are  treated  as  constant  while  the  acceler¬ 
ations  are  considered  a  static  quantity.  As  appUed 
to  our  coupled  state/ad^oint  differential-algebraic 
equations  our  strategy  involves  investigating  the  dy¬ 
namics  of  the  state  and  adjoint  systems  separately: 
the  key  Ues  in  looking  at  the  dynamics  of  the  orig¬ 
inal  system  first.  The  iteration  process  is  outlined 
below  and  closely  parallels  that  given  in  Ref.  (13). 

Before  we  continue,  we  remark  that  with  smt- 
able  defintions,  we  may  express  the  state/adjoint 
differential-algebraic  equation  as 

Qi  =  m  (9i  “)  + 

subject  to  ^0(9)  — 


.  d(po 

Vi  =  pi{q,q,v,v,u, 

subject  to  Vj  djo{q)  =  0* 


Now  then,  the  iterative  scheme  triggered  at  each 
time  step  is  based  upon  the  following  approximation 
to  the  original  system: 


9/*  =  Vi  (9i  9i  ®)  +  *^io 

^  dt^dqi 
+  2Cuipo  +  vj'^<Po], 


(16) 


AS+"  =  AS 

-it 


-t- 

with  AS  =  O' 


(17) 


In  the  above,  9”  and  AS  represent  current  approxi¬ 
mations  to  the  true  accelerations  and  Lagrange  mul- 
tipHers  respectively,  while  the  bracketed  term  rep¬ 
resents  a  measure  of  constraint  violation.  Further, 
n  is  the  iteration  number,  c>  0  is  a  small  penalty 
factor,  and  >  0  represent  a  damping  factor  and 
frequency  associated  with  the  constraint  violation. 

The  iterative  procedure  at  time  t  begins  by  solv¬ 
ing  eq.(16)  for  the  approximate  acceleration  9". 
This  is  then  substituted  into  eq.(17)  where  an  u^ 
date  to  the  approximate  Lagrange  multipliers  AS 
is  obtained.  This  is  then  substituted  back  into 
eq.(16)  and  the  iterative  process  continues  untU  con¬ 
vergence  is  recognized.  For  the  sake  of  brevity,  we 
only  mention  here  that,  convergence  of  the  method 
may  be  shown  (9"  —*  qj  and  AS  — >  Ao).  That  is,  the 


approximate  accelerations  and  Lagrange  multipliers 
approach  the  true  values  in  the  limit.  The  proof 
relies  on  the  fact  that  the  mass  matrix  is  positive 
definite,  e  >  0,  and  by  requiring  that  the  constraint 
jacobian  maintain  full  rank. 

Now  then,  having  converged  to  the  true  acceler¬ 
ations  and  Lagrange  multipliers  of  the  original  sys¬ 
tem,  we  next  introduce  an  approximation  to  the  ad¬ 
joint  system  as  was  done  for  the  original  system. 


d(po 

vP  =  Pi  (9i  9.  *>  A)  +  % 


dqi 


^  U.  S’* 


(18) 


7S^^ 


=  7S 

-^[djovP 


+  -b 
with  'fg  =  0. 


(19) 


Here,  u”  and  7S  represent  current  approximations 
to  the  true  accelerations  and  Lagrange  multipliers  of 
the  adjoint  system,  respectively,  wWle  the  bracketed 
term  represents  a  measure  of  constraint  violation. 
Again,  n  is  the  iteration  number,  e  >  0  is  a  small 
penalty  factor,  and  C."  >  0  represent  a  damping 
factor  and  frequency  associated  with  the  constrmnt 
violation. 

The  iterative  procedure  at  time  t  is  perforrned 
on  eqs.(18)  and  (19)  just  like  it  was  for  the  origi¬ 
nal  system.  This  iterative  scheme  is  also  convergent 
(vT*  _>  Vj  and  7"  ^  7o):  that  is  the  approximate 
adjoint  accelerations  and  associated  Lagrange  mul¬ 
tipliers  approach  their  true  values  in  the  limit. 

Thus,  careful  application  of  the  augmented  La- 
grangian  method  to  the  numerical  solution  of  the 
coupled  differential-algebraic  equations,  which  d^ 
fine  the  necessary  conditions  to  optimal  control,  is 
seen  to  be  a  suitable  and  attractive  solution  process. 


Tllnstrative  Examples 

We  now  focus  on  illustrative  examples.  The  pre 
vious  section  outlined  numerical  techniques  which 
may  be  employed  within  the  solution  process  of 
a  chosen  numerical  method  to  solving  the  two- 
point  boundary  value  problem.  For  all  the  exam¬ 
ples  below,  we  use  a  shooting  method  of  solution. 
The  results  are  obtained  through  using  the  codes 
DNEQNF  avrdlable  in  the  IMSL^^  library. 


The  first  example  is  a  two-link  rigid  manipulator 
shown  in  Fig.  1.  The  system  properties  are  listed 
in  Table  1.  In  the  simulation,  we  slew  both  links 
through  angles  of  90*  in  a  prescribed  time.  We 
enforce  that  the  controls  begin  and  end  at  zero. 
Results  are  shown  Figs,  l(a-c). 

The  second  example,  shown  in  Figure  2  repre¬ 
sents  a  free  floating  satellite.  Table  2  contains  the 
system  properties.  A  similar  system  was  presented 
in  Ref.  15.  The  system  begins  in  a  folded  up  fash¬ 
ion  and  the  optimal  control  is  found  to  rotate  the 
main  body  through  90°  while  extending  the  arms 
in  the  outreached  postion  of  90°  in  a  prescribed  fi¬ 
nal  time.  In  Ref.  15,  the  relative  angles  between 
the  bodies  are  chosen  as  the  generalized  coordinates. 
This  description  results  in  the  main  body  angle  a 
being  an  ignorable  coordinate  (a  statement  of  the 
conservation  of  angular  momentum  for  the  system). 
The  equations  of  motion  aie  put  into  a  normal  form 
via  a  feedback  transformation,  and  pseudo  control 
functions  are  sought  rather  than  the  acuator  con¬ 
trol  torques.  Here,  we  select  the  absolute  angles, 
(as  measured  from  a  reference)  as  the  generalized 
coordinates.  In  this  description,  a  is  no  longer  an 
ignorable  coordinate.  We  do  use  a  simple  stabiliza¬ 
tion  procedure^®  to  acurately  enforce  the  riprous 
integral  of  motion  (angular  momentum)  while  nu¬ 
merically  integrating  the  system  equations.  Results 
are  shown  in  Figs.  2(a-d). 

The  last  example  represents  a  holonomically  con¬ 
strained  system.  A  two-link  rigid  manipulator  sys¬ 
tem  is  constrained  to  tem^  in  contact  with  a  sur¬ 
face  (cf.  Fig.  3).  The  constraint  function  for  this 
example  is 

tp  —  lx  cos  $1  +  h  cos  02  ~  ^ 

The  system  properties  are  listed  in  Table  3.  The 
end  effector  is  moved  a  distance  along  the  surface 
in  a  prescribed  time.  The  augmented  Lagrangian 
method  presented  earlier  is  used  to  enforce  the 
constraint  and  the  results  are  shown  in  Figs.  3(a-d). 


Conclusions 

We  have  investigated  the  necessary  conditions  re¬ 
lated  to  the  optimal  control  of  natural  second  order 
systems.  These  systems  represent  a  significant  class 
of  problems  in  analytical  mechanics;  most  notably, 
robotic  and  satellite  systems  wherein  the  joint  an¬ 
gles  between  substructures  may  undergo  large  rota¬ 
tions.  We  have  presented  a  new  approach  to  op¬ 
timal  control  of  natural  systems  subject  to  holo- 
nomic  constraints.  In  this  approach,  the  differential- 
algebraic  equations  are  augmented  to  a  performance 


index  and  variational  calculus  techniques  are  used 
to  obtain  the  necessary  conditions.  Like  the  origi¬ 
nal  dynamical  system,  the  resulting  adjoint  system 
is  also  constrained.  A  careful  application  of  an  aug¬ 
mented  Lagrangian  method  is  proposed  to  enforce 
the  constraints  relationships  of  the  original  and  ad¬ 
joint  systems  during  the  numerical  solution  of  the 
two-point  boundary  value  problem.  Also,  the  dif¬ 
ferential  equations,  as  presented,  are  readily  suit¬ 
able  to  numerical  integration  by  implicit  integration 
schemes  recently  developed. 
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Table  2.  System  parameters 
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Abstract — ^We  present  some  elegant  concepts  &om  stability  theory,  and  consider 
their  applicability  to  the  problem  of  designing  control  laws  for  many  degree  of 
freedom  nonlinear  dynamical  systems.  While  the  spirit  of  our  presentation  is 
classical,  we  include  some  novel  stability  results  and  methodology  for  designing 
globally  stable  control  laws  for  nonlinear  dynamical  systems.  The  Lyapunov 
approach  is  attractive  because  it  provides  the  most  broadly  applicable  approach 
to  stability  analysis  and  guaranteed  stable  controller  design  for  nonlinear,  time 
varying,  and  distributed  parameter  systems.  Espedally  significant  is  the  fact  that 
the  Lyapunov  approach  leads  to  a  unihed  stability  and  control  perspective  for  both 
linear  and  nonlinear  ^sterns,  as  well  as  systems  described  by  ordinary,  partial,  iand 
hybrid  differential  equations.  The  first  half  of  thb  chapter  is  an  efficient  summary 
of  the  main  features  of  Lyapunov  stability  theory;  however,  a  few  examples  are 
considered  to  help  illustrate  this  material.  The  second  half  of  the  chapter  is 
addressed  to  studies  wherein  we  formulate  stabilizing  feedback  control  laws  for 
multibody  distributed  parameter  systems  undergoing  large,  generally  nonlinear 
motions.  Analytical,  numerical,  and  experimental  results  are  discussed. 


Sec.  3.1.  Basic  Definitions 


109 


3.1  BASIC  DEFINITIONS 

Consider  a  continuous,  finite-dimensional  dynami^  system  which  can  be  described 
by  a  first-order  nonlinear  vector  differential  equation  of  the  form 

x  =  f(x,t),  xeR"  (3-1) 

where  x(t)  is  the  state  vector  at  time  t.  and  the  dot  denotes  time  differentiation. 

Definition  3.1:  Equilibrium  State 

A  vector  Xe  €  R"  is  said  to  be  an  egMum  state  of  the  system  described  by 

Eo.  (3.1)  at  time  to  if  /, 

^  f(xe.t)  =  0  Vt>to  (3-2) 

If  Xe  is  an  equilibrium  state  of  Eq.  (3.1)  at  time  to,  then  Xe  is  also  ^  f-n'f 
st2  of  Eq.  (3.1)  at  all  times  ti  >  to.  In  other  words,  a  motion  initiating  exactly 

at  Xe  at  some  time,  remains  there  for  all  time. 


Definition  3.2:  Stability  of  an  Equilibrium  State 

The  equilibrium  state  Xe,  or  the  equilibrium  solution  x(t)  =  ^, 

if  for  any  given  to  and  positive  «,  there  exists  a  positive  «(«,  to)  such  that 

varying  trajectory  (or  solution)  x(t)  initiating  (time  to)  at  a  point  xq 

witSn  in  aVneighborhood  of  Xe  {IN  -  Xell  <  S.  xq  =  x(to)}  remains  for  all 

time  within  an  e-Lighborhood  of  xe  {l|x(t)  -Xell  <  «  V  t  >  to).  The  equilibrium 

state  is  said  to  be  unstable  if  it  is  not  stable. 


Definition  3.3:  Asymptotic  Stabifity  of  an  Equilibrium  State 
The  equilibrium  state  xe  is  said  to  be  asymptotically  stable,  if 

(a)  it  is  stable  (Definition  3.2),  and  if  in  addition 

(b)  for  any  to,  there  exist  a  ^i(to),  such  that 

llxo-X€ll<  ^i  implies  that  ^limx(t)-+xe  (3-3) 


If  6  and  6i  are  not  functions  of  to,  then  the  equilibrium  state  is  smd  to  be 
uniformly  sialic  and  uniformly  asympioiically  sialic,  respectively.  Defimtions  3. 
and  3.3  constitute  the  two  basic  definitions  of  stability  of  an  equilibrium  state 
(a  fixed  point  in  the  state  space)  for  an  unforced  continuous  time  system.  More 
generally,  we  need  to  consider  the  stability  of  a  trajectory  or  a  motion.  Qualitatively, 
stability  of  a  trajectory  is  concerned  with  whether  or  not  a  perturbed  motion  remains 
near  the  unperturbed  trajectory,  or  diverges  from  it.  Stability  of  a  motion  is  of 
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central  interest  in  many  practical  feedback  control  situations  whereby  a  system  is 
designed  to  execute  a  large  nominal  motion,  and  control  inputs  must  be  developed 
not  only  to  generate  the  nominal  motion,  but  also  closed  loop  feedback  is  required 
to  stabilize  neighboring  motions,  with  respect  to  the  nominal  motion,  so  that  the 
actual  system  will  behave  in  a  near-nominal  fashion. 

Definition  3.4:  Stability  of  a  Motion 

The  motion  x(t)  is  smd  to  be  stable  if,  for  all  initial  times  to  and  prescribed  positive 
€,  there  exists  a  positive  £(e,to),  such  that 

l|x(t)  —  x(t)||  <  €  V  t  >  to  if  llxQ  —  xqH  <  6 

where  x(t)  and  x(t)  are  neighboring  trajectories  with  the  pven  initial  conditions 
xq  and  xq,  respectively,  at  time  to. 

■ 

This  hounded  motion  stahiliiy  properly  is  sometimes  referred  to  as  “path  stabil¬ 
ity.”  Qualitatively,  path  stability  means  that  “if  the  perturbed  initial  state  x(to)  is 
near  x(to),  then  the  ensuing  perturbed  trajectory  x(t)  will  remain  near  x(t)  for  all 
time  t.” 

Definition  3.5:  Asymptotic  Stability  of  a  Motion 
The  motion  x(t)  is  said  to  be  asymptotically  stable  if 

(a)  it  is  stable  (Definition  3.4),  and  if  in  addition 

(b)  for  any  to,  there  e:dst  a  positive  5i(to),  such  that 

llxo-3col|<«i  implies  that  ^lim  ||x(t)  -  x(t)ll  =  0  (3.4) 

Note  that  x(t)  is  any  member  of  the  set  of  neighboring  (perturbed)  trajectories 
satisfying  Eq.  (3.4),  and  all  members  of  this  set  asymptotically  approach  x(t). 

■ 

The  above  definitions  are  not  directly  concerned  with  the  global  properties  of 
systems,  but  of  the  local  motion  in  a  finite  local  neighborhood  of  an  equilibrium 
state  or  a  motion  of  the  system  of  differenital  equations.  If  a  system  has  a  globally 
asymptotically  stable  equilibrium  state,  then  it  is  obviously  the  onlg  equilibrium 
state,  and  every  motion  converges  to  that  unique  equilibrium.  An  analogous  global 
stability  property  can  be  defined  for  the  stability  of  a  motion. 

The  simplest  class  of  Lyapunov  stability  analysis  methods  arises  in  the  context 
of  systems  described  by  linear  unforced  difTerential  equations.  We  summarize  some 
of  the  central  ideas  and  results  below. 

Consider  the  linear  system 


x(t)  =  A(t)x(t) 


0^7 
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which  obviously  has  an  equilibrium  state  at  the  origin.  This  linear  system  can  be 
dassified  as  stable,  asymptotically  stable,  or  unstable,  depending  on  the  stability 
of  the  origin  (Vidyasagar  1978],  [WiUems  1970]. 

Now,  we  introduce  two  definitions  associated  with  the  concept  of  positive  defimte 
functions,  these  are  of  central  importance  when  applying  Lyapunov  stabiUty  theory. 

Definition  3.6:  Positive  Definite  Function 

A  singled-valued  ftmction  U(x),  which  is  continuous  and  has  contmuous  partial 
derivatives  with  respect  to  the  components  of  the  vector  x,  is  sdd  to  be  positive 
definite  in  some  region  fi  about  the  origin  if  it  vanishes  at  the  origin  and  is  positive 
elsewhere,  i.c., 

(i) U(0)  =  0 

(ii)  U(x)  >  0  for  all  nonzero  x  € 


If  the  positivity  condition  (ii)  is  relaxed  to  simply  the  non-negative  condition 
U(x)  >  0  for  all  X  €  fl,  then  U(x)  is  said  to  be  positive  semidefinite.  If  the  inequality 
sign  in  (ii)  is  reversed,  then  the  condition  for  a  negative  definite  function  is  obtained. 
If  a  function  is  neither  positive  nor  negative  definite,  then  it  is  indefinite. 

Definition  3.7:  Positive  Definite  Quadratic  Forms 

In  the  analysis  of  linear  dynamical  systems,  quadratic  functions  of  the  state  vector 
arise  often  in  the  context  of  energy,  stability  and  control  analyses,  ^pecially 
important  are  symmetric  quadratic  forms.  The  quadratic  form  U(x)  =  x  Qx  said 
to  be  jposiiivt  definite  if 

U(x)  =  x'^Qx  >  0  for  all  nonzero  x  €  R” 
where  Q  is  a  real  symmetric  matrix. 

■ 

Definition  3.7  is  equivalent  to  requiring  that  all  the  eigenvalues  of  Q  are  strictly 
positive,  such  a  matrix  is  naturally  called  a  positive  definite  matrix. 

Further  discussion  of  these  concepts  is  presented  in  [Vidyasagar  1978]  and 
[Willems  1970]. 

The  following  example  illustrates  the  ideas  underlying  the  above  discussion. 

Example  3.1 
Consider  the  functions: 

Ui(x)  =  Xi  +  X2  +  x|  and  U2(x)  =  (xi +X2  +  Xg)^. 

Clearly  Ui  satisfies  the  condition  of  Definition  3.7,  therefore  it  is  a  positive  definite 
function  in  a  three-dimensional  space,  but  Ui  is  only  positive  semidefinite  if  the 
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underlying  space  has  more  thM  three  dimensions.  Uj  is  only  positive  semidefinite 
in  three  space,  since  it  is  zero  everywhere  in  the  plane  Xi  +  xj  +  X3  =  0. 


3.2  LYAPUNOV  STABILITY  THEORY  (LYAPUNOV’S  DIRECT 
METHOD) 

The  central  ideas  of  the  Lyapunov  stability  theorem  are  now  introduced.  For  a 
given  general  nonlinear,  forced,  dissipative  mechanical  syst^,  it  is  oft^  useM  to 
consider  a  conservative  idealized  approximation  of  system  without  the  dissipative  or 
nonconservative  external  forces  acting.  For  this  idealized  nonlinear  system,  suppose 
that  there  exists  one  equilibrium  state  Xe  of  the  system.  Also  suppose  that  the  to^ 
mechanical  energy  or  Hamiltonian  of  this  idealized  system  is  a  poative  definite 
function  and  is  an  exact  integral  of  the  idealized  system.  For  a  broad  class  of 
practical  applications,  the  total  energy  or  Hamiltonian  of  an  ide^ed  conservative 
system  is  a  suitable  Lyapunov  function  for  studying  the  stability  of  the  system, 
including  dissipative  internal  and  external  forces;  for  many  applications,  it  naturally 
occurs,  or  be  arranged  that  the  equilibrium  stale  is  the  target  state  for  the 
system.  More  generally,  a  candidate  Lyapunov  function  must  belong  to  a  class  of 
admissible  ‘energy’  functions  which  have  as  the  most  fundamental  property  that 
they  are  zero  at  the  equilibrium  state  and  positive  everywhere  else. 

Now  let  us  assume  that  the  system  is  initially  perturbed  to  a  state  neighboring 
the  equilibrium  point  where  the  energy  level  is  positive  by  assumption,  and  we 
consider  the  time  evolution  of  the  distance  to  the  equilibrium  as  measured  by  the 
energy  function.  Depending  on  the  nature  of  the  selected  “energy”  (Lyapunov 
function),  the  stability  of  the  motion  may  be  described  qualitatively  as  follows: 

(i)  if  the  system  dynamics  evolve  such  that  the  initial  energy  of  the  system  is  not 
increasing  with  time  for  all  starting  points  in  a  finite  neighborhood,  we  can  conclude 
that  the  equilibrium  state  b  stable, 

(ii)  if  the  system  dynamics  evolve  such  that  the  energy  of  the  system  b  monoton- 
ically  decreasing  with  time  for  all  initial  conditions  in  the  neighborhood  (and  thus 
eventually  approaches  zero),  the  equilibrium  state  b  asymptotically  stable, 

(iii)  if  the  energy  of  the  system  b  increasing  with  time,  for  any  initial  condition  in 
the  neighborhood,  then  the  equilibrium  state  b  unstable,  and 

(iv)  if  the  chosen  energy  measure  b  indefinite  (i.e.,  it  b  neither  strictly  decreasing 
nor  increasing),  then  no  conclusion  can  be  drawn  on  the  stability  of  the  system. 
The  following  theorem,  which  b  a  rigorous  statement  of  the  above  remarks,  b  the 
basic  stability  concept  underlying  Lyapunov’s  direct  (second)  method. 

Theorem  3.1:  Stability  Theorem 

The  equiUbrium  state  xe  b  stable  if  there  eodsts  a  continuously  differentiable 
function  U(x)  such  that 
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(i)  U(xe)  =  0 

(ii)  U(x)  >0  for  all  X  #  xe,  X  6 12 

(iii)  U(x)  <  C  for  all  X  xe,  X  G  n 


aw..  7  — w, 

where  U(x)  denotes  the  time  derivative  of  the  function  U(x),  and  fl  is  some  region 
containing^xe.  Notice  that  the  “energy  rate”  U(x)  is  ev^uated  along  a  iyptcc! 

>ni  the  conations  (U)  .ad  OB)  mast  Bold  dong  .11  ./ 

injectories  ofihe  dynamical  system,  which  ensue  from  imtial  states  in  0. 


A  modest  perturbation  of  Theorem  3.1  (making  the  fin^  inequahty  strict)  results 
in  the  following  theorem,  which  provides  necessary  and  suffiaent  conditions  for 
asymptotic  stahility. 

Theorem  3.2:  Asymptotic  Stability  Theorem 

The  equilibrium  state  Xe  is  asymptotically  stable  if  there  ensts  a  continuously 
differentiable  function  U  such  that 


(i)  U(xe)  =  0 

(ii)  U(x)  >0  for  all  X  #  Xe,  X  G  f2 
(ui)  U(x)  <0  for  all  X  #  Xe,  X  G  f2 


Both  of  the  previous  theorems  relate  to  local  stability  m  the  vicinity  of  the 
equilibrium  state.  A  system  has  global  asymptotic  stability  with  respect  to  a  unique 
equilibrium  point  if  the  following  theorem  is  satisfied. 

Theorem  3.3:  Global  Asymptotic  Stability  Theorem 

The  equilibrium  state  xe  is  globally  asymptotically  stable  if  there  exists  a  continu¬ 
ously  differentiable  function  U  with  the  following  properties: 

(i)  U(xe)  =  0 

(ii)  U(x)  >0  for  aU  X  #  Xe 

(iii) U(x)<0  forallx^txe 

(iv)  U(x)  — B  oo  as  11x11  — ►  oo 


Note  that  the  stable  region  J2  extends  to  infinity  in  Theorem  3.3.  The  reader  is 
referred  to  (Vidyasagar  1978]  for  further  discussion,  including  the  complete  proofs 
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of  the  above  theorems.  Observe  that  there  is  no  one  unique  Lyapunov  function  for 
a  given  system;  some  may  be  better  than  others.  This  is  especially  important  when 
we  seek  the  “least  conservative”  stability  information  when,  for  example,  we  seek 
to  determine  the  size  of  the  fi  region  in  which  we  have  stability.  If  a  poor  choice 
of  U(x)  results  in  a  pessimistic  conclusion  that  the  stable  region  fl  is  much  smaller 
than  it  actually  is,  then  this  is  an  obvious  concern.  It  also  should  be  noted  that  if  a 
Lyapunov  function  cannot  be  foimd,  nothing  can  be  concluded  about  the  stability  of 
the  system,  since  the  Lyapunov  stability  theorem  provides  ojdy  efficient  conditions 
for  stability.  Therefore,  the  conditions  required  to  prove  stability,  based  upon  an 
arbitrary  choice  of  Lyapunov  function,  may  be  very  conservative. 

Unfortunately,  the  above  classical  Lyapunov  theorems  are  not  constructive]  these 
stability  do  not  reveal  a  process  to  find  a  candidate  Lyapunov  function.  It 

is  often  difficrdt  to  find  a  suitable  Lyapunov  function  for  a  given  nonlinear  system. 
The  physical  and  mathematical  insights  of  the  analyst  have  historically  played  an 
important  role  in  most  successful  applications  of  this  approach;  however,  more 
systematic  methods  have  recently  emerged  [Oh  1991]  ,  [Junkins  1993,  1991,  1990] 
for  cert^n  classes  of  control  design  problems.  In  particular,  when  the  stability 
analysis  and  the  control  design  analysis  are  merged,  one  is  often  able  to  exploit  the 
additional  freedom  to  simultaneously  design  control  laws  and  select  a  Lyapunov 
function  which  guarantees  stability  of  the  closed-loop  (controlled)  system. 

Example  3.2 

Consider  the  system  described  by  the  nonlinear  ordinary  differential  equation 

x(t)-£x2(t)x(t)  +  x(t)  =  0 

The  objective  is  to  use  Lyapimov  analysis  to  investigate  the  stability  of  motion  near 
the  origin  for  this  system. 

Introducing  the  state  variable  representation  of  this  system  with  the  defimtions 
Xi  =  X,  X2  =  X,  we  write  the  equivalent  first-order  system 

Xl  =  X2,  X2  =  — Xi  +  CX1X2 

3 

It  is  easy  to  see  that  the  above  “oscillator  with  quadratic  damping”  has  an 
equilibrium  state  at  the  origin  (xi,X2)  =  (0,0).  Our  goal  is  to  determine  if  this 
state  is  stable.  For  this  purpose,  let  us  choose  the  simplest  candidate  Lyapunov 
function  is  2U(xi,X2)  =  xf  +  3^.  We  note  that  a  physical  motivation  for  choosing 
this  positive  definite  function  as  a  candidate  Lyapunov  function  is  that  it  is  an  exact 
(total  mechanical  energy)  integral  of  the  system,  for  c  =  0.  Clearly,  this  candidate 
function  satisfies  the  two  most  fundamental  necessary  conditions  that  U(0,0)  =  0 
and  U(xi,X2)  >  0  in  any  neighborhood  of  (0,0),  and  we  find  that 


il(xi,X2)  =  XiXi  -P  X2X2  =  X1X2  -bX2(-Xi  +  fx]x2)  =  Cx|x| 

Thus  U  is  <1  positive  definite  function  which  is  strictly  decreasing  along  all  system 
trajectories  if  c  <  0.  Therefore,  by  the  above  theorems,  the  origin  (0,  0)  is  a  globally 
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stable  equilibrium  point  for  c  =  0,  is  globaUy  asymptotically  stable  for  c  <  0,  and 
is  globally  unstable  for  c>  0.  Thus  Lyapunov  analysis  was  completely  successful  in 
establishing  the  global  stability  characteristics  of  this  system. 


Example  3.3 

Investigate  the  stability  of  the  system  of  nonlinear  differential  equations 

X1  =  Xi(xJ  +  x’  -  1)  —  X2|  X2  =  Xi  +  X2(Xi  +  Xj  -  1). 

We  try  the  candidate  Lyapunov  function  2U(xi,X2)  =x?  +x|,  which  is  an  exact 
integral  of  the  simplified  system  Xi  =  -X2 ,  X2  =  Xi .  This  choice  for  U  is  obviously  a 
positive  definite  function  having  its  global  minimun  at  the  origin.  It  is  also  obvious 
by  inspection,  that  the  origin  is  the  only  equilibrium  pomt  of  the  nonlinear  system. 
Investigating  the  energy  rate,  we  find 

U(xi,X2)  =  (xf  +  x^)(xj  +  xi  - 1). 

It  is  evident  that  U  is  negative  definite  over  the  finite  circular  region 
{(xi,X2)|  xf  +  X2  <  1},  which  includes  the  equilibrium  point  at  the  origin.  Hence, 
the  origin  (0,0)  is  an  asymptotically  stable  equilibrium  state  of  this  system.  Note 
that  all  points  within  the  unit  circle  are  asymptotically  attracted  to  the  origin. 
However,  because  U  is  not  a  negative  definite  function  over  all  of  iJ",  we  cannot 
conclude  global  asymptotic  stability  without  more  information.  While  we  are  cer¬ 
tain  we  have  stability  within  the  unit  circle,  this  conclusion  results  from  a  particular 
choice  of  U(xi,X2),  and  without  further  analysis,  we  cannot  conclude  that  the  sta¬ 
ble  region  is  not  actually  larger  than  the  unit  circle.  However,  since  U  is  positive 
everywhere  outside  the  unit  circle,  we  conclude,  using  the  following  Theorem  3.4, 
that  Toe  have  insiabilHy  for  all  irajeciories  which  initiate  outside  the  unit  circle  and 
asymptotic  stability  for  all  trajectories  initiating  inside  the  unit  circle.  Thus,  we 
are  able  to  use  the  stability  and  instability  insights  simultaneously  to  “establish 
the  complete  story”  vis-a-vis  the  global  stability  properties  of  this  system,  since 
the  stable  and  unstable  regions  have  a  mutual  boundary  and  together  the  stable  and 
unstable  regions  span  all  of  state  space  R^. 


The  following  theorem  is  sometimes  usefiil  in  avoiding  a  fruitless  search  for 
Lyapunov  functions  for  systems  which  are  inherently  unstable  in  certain  regions 
of  state  space.  This  theorem  is  also  useful  in  obtaining  theoretical  closure  of 
the  stability  analysis,  in  the  sense  that  it  is  sometimes  possible  simultaneously  to 
apply  the  instability  theorem  with  the  stability  theorems  to  establish  conclusively 
a  particular  system’s  global  stability  properties.  In  Example  3.3,  for  example,  we 
concluded  that  our  simple  choice  on  U  gave  us  all  of  the  stability  information  (<.e., 
the  system  is  stable  only  within  the  unit  circle). 
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Theorem  3.4:  Instability  Theorem 

The  equilibrium  state  xe  is  unstable  in  Q  if  there  exists  a  continuously  differentiable 
function  U  such  that 

(i)  U(xe)  =  0  and  U(xe)  =  0 

(ii)  U(x)  >0  for  all  X  Xe,  X  €  fi 

(iii)  and  there  costs  points  x  arbitrarily' close  to  Xe  such  that  U(xe)  >  0 


If  one  can  find  any  fimction  U  satisfying  the  above  conditions,  then  xe  is  a 
completely  unstable  equilibrium  point  in  and  the  quest  for  Lyapunov  functions 
can  be  halted.  In  Example  3.3,  the  Q  for  the  instability  theorem  is  clearly  the 
compliment  of  the  fi  for  the  asymptotically  stable  region,  and  it  is  apparent  that 
ihe  stable  and  unstable  regions  being  complimentary,  (together  spanning  all  of  state 
space)  is  the  key  to  establishing  global  stability/instability  information. 

3.3  STABILITY  OF  LINEAR  SYSTEMS 
3.3.1  Lyapunov  Theorem  for  Linear  Systems 

Lyapunov’s  method  is  easily  applied  to  test  the  stability  of  a  linear  system.  Consider 
an  autonomous  system  described  by  the  linear  vector  differential  equation 

x(t)  =  Ax(t)  (3.5) 

The  above  ^stem  is  said  to  be  stable  in  the  sense  of  Lyapunov,  if  the  solution  of 
Eq.  (3.5)  tends  toward  zero  (which  is  obviously  the  only  equilibrium  state  if  A  is  of 
full  rank)  as  t  — ►  oo  for  arbitrary  initial  condition. 

Consider  the  case  of  a  constant  A  matrix.  If  all  eigenvalues  of  A  are  distinct, 
the  response  of  system  (3.5)  due  to  initial  condition  xq  can  be  written  as 

3'(t)  =Esfxoe^‘*^  (3.6) 

where  A|  axe  the  eigenvalues  of  A,  and  arcj  respectively,  the  right  and  left 
eigenvectors  of  A  associated  with  Aj.  ror  the  repeated  eigenvalue  case,  the  situation 
is  more  complicated  (i.c.,  we  should  solve  for  the  generalized  eigenvectors  of  A).  The 
generalization  of  Eq.  (3.6)  for  the  case  of  generalized  eigenvectors  has  a  similar  form, 
but  is  not  discussed  here  [Chen  1984].  From  Eq.  (3.6),  we  can  see  by  inspection 
that  the  system  is  asymptotically  stable  if  and  only  if  all  the  eigenvalues  of  A  have 
negative  real  parts,  i.e., 

S[Ai(A)]<0  .  (3.7) 
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Thus,  we  have  the  well  known  result  that  the  stability  of  a  linear  constant- 
coefficient  dynamical  system  can  be  completely  characterized  by  the  signs  of  the 
real  parts  of  the  eigenvalues  of  the  system.  This  approach  to  stability  analy^  yields 
both  necessary  and  sufficient  conditions.  However,  calculatmg  all  the  eigenvalues  of 
the  system  matrix  is  not  always  desirable,  especially  for  high-dimensioned  ^sterns. 
As  will  be  evident  below,  other  stability  viewpomts  lead  to  important  insights 
and  generalized  methods,  especially  vis-a-vis  stability  analysis  for  time-varying, 
distributed-parameter,  and  nonlinear  systems. 

For  the  linear  dynamical  system  of  Eq.  (3.5),  we  choose  a  symmetric  quadratic 
form  as  a  candidate  Lyapunov  function 

2U(x)  =  x^'Px  (3.8) 

where  P  is  a  positive  definite,  real  symmetric  matrix.  Thus  U  is  p^itiye  definite 
with  it’s  global  minimun  at  the  origin,  which  is  obviously  an  eqi^brium  sUte. 
Differentiating  Eq.  (3.8)  and  substituting  Eq.  (3.5)  into  the  result  gives 

U(x)  =  x‘^^(A‘^P-^PA)x.  ;  (3.9) 

Using  the  Lyapunov  stability  Theorem  3.2,  we  require  U(x)  to  be  negative  definite. 
We  can  rewrite  the  energy  rate  of  EJq.  (3.9)  as 

U(x)  =  -x'^Qx.  (3.10) 

So  we  see  that,  for  asymptotic  stability,  P  and  Q  must  be  positive  definite  matrices 
which  satisfy  the  condition 

ATp-j-PA  =  -Q.  (3.11) 

Equation  (3.11)  is  commonly  known  as  the  algebraic  Lyapunov  equation. 

To  examine  the  stability  of  a  liT»»ar  system  via  the  above  Lyapunov  approach  we 
can  proceed  as  follows:  **Choose  Q  to  be  any  positive  definite  matrix  for  a  ^ven  A, 
and  check  the  eigenvalues  of  the  resulting  P  which  we  obtain  by  solving  Eq.  (3.11), 
if  P  is  positive  definite  (a//  positive  eigenvalues),  the  given  system  is  asymptotically 
stable,  while  if  P  has  any  negative  eigenvalues,  the  system  is  unstable.”  One  of  the 
potential  difficulties  with  selecting  Q  and  solving  the  Lyapunov  equation  (which,  of 
course,  depends  on  the  system  matrix  A)  is  the  uniqueness  of  the  resulting  solution 
for  P.  The  following  theorem  gives  the  necessary  and  sufficient  conditions  for  the 
Lyapunov  EJq.  (3.11)  to  have  a  unique  solution. 

Theorem  3.5 

If  {Ai,...,A„}  are  the  eigenvalues  of  the  system  matrix  A,  then  the  Lyapunov 
equation  (Eq.  (3.11)]  has  a  unique  solution  P  if  and  only  if 

Ai  +  Afy^O,  i,j  =  l,...,Ti 

where  (  )^  denotes  complex  conjugate.  ■ 
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The  reader  is  referred  to  [Chen  1984]  for  a  proof  of  the  above  theorem.  Thus, 
we  cannot  solve  the  Lyapunov  equation  for  undamped  second-order  systems  having 
pairs  of  eigenvalues  on  the  imaginary  axis  (including  rigid  body  modes,  whose 
eigenvalues  reside  at  the  origin  of  the  complex  plane),  and  so  stability  analysis 
for  systems  having  a  neutrally  stable  subspace  cannot  be  completed  via  solution  of 
an  algebraic  Lyapunov  equation. 

Theorem  3.6:  Lyapunov  Stability  Theorem  for  Linear  Systems 

A  linear  system  is  asymptotically  stable  or,  equivalently,  all  the  eigenvalues  of  A 
have  negative  real  parts,  if  and  only  if  for  any  ^ven  positive  definite  symmetric 
matrix  Q  there  exists  a  positive  definite  (symmetric)  matrix  P  that  satisfies  the 
Lyapunov  equation 

A'fp  +  PA  =  -Q  (3.12) 

■ 

The  proof  of  this  theorem  is  given  in  [J unkins  1993].  Note  that  the  Lyapunov 
equation  is  equivalent  to  a  set  of  n(n  +  l)/2  linear  equations  in  n(n  +  l)/2 
unknowns  for  an  n-ih  order  system.  The  Lyapunov  equation  can  be  solved  by  using 
numerical  algorithms  utilizing  QR  factorization,  Schur  decomposition,  or  spectral 
decomposition;  however,  our  experience  indicates  that  the  most  efficient  and  robust 
algorithms  utilize  the  QR  factorization  [Junkins  1993], 

Example  3.4 

Consider  the  system  matrix 


The  simplest  choice  of  Q  is  the  identity  matrix  or  some  other  diagonal  matrix;  we 
take  Q  =  I  for  this  example,  and  let  the  three  distinct  elements  in  P  be  denoted 


Substituting  this  A  and  P  into  the  Lyapunov  equation  [Eq.  (3.11)]  yields  the 
following  three  linear  algebraic  equations 

-4pi-2p2  =  -1 

Pi  —  P2  —  P3  =  0 

2p2  +  2p3  =:  —1. 


The  solution  of  these  three  equations  is  straightforward;  we  find 


z/r 

-2 
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Even  though  we  have  a  unique  solution,  the  resulting  matrix  P  is  not  positive 
definite.  Hence,  we  conclude  that  the  system  is  unstable,  and  implicitly,  that  not 
all  of  the  eigenvalues  of  A  have  negative  real  parts.  We  would  have  to  calculate  the 
eigenvalues  to  make  further  assessments  of  eigenvalue  placement. 

■ 

In  the  case  of  a  linear  time-varying  system  x(t)  =  A(t)x(t),  the  sufiBcient 
conditions  for  the  stability  of  the  equilibrium  state  can  be  analyzed  based  on  the 
concept  of  matrix  measure  [Vidyasagar  1978],  and  if  the  system  is  asymptotically 
stable,  then  a  quadratic  Lyapunov  function  exists  for  this  system.  Of  course, 
conventional  eigenvalue  analysis  is  not  applicable  to  the  time-var^g  case  and, 
therefore,  the  more  general  Lyapunov  approach  provides  one  possible  avenue  to 
characterize  the  stability  of  nonautonomous  systems. 

3.3.2  Linear  Dynamic  Systems  Subject  to  Arbitrary  Disturbances 

To  make  the  Lyapunov  stability  analysis  in  this  section  more  complete,  we  briefly 
discuss  stability  in  the  presence  of  disturbances.  We  consider  the  class  of  systems 
described  by  the  matrix  diflerential  equation 

x(t)  =  Ax(t)-hf(t,x(t))  (3.13) 

where  the  uncertainty  and/or  perturbations  of  the  system  are  assumed  representable 
by  arbitrary  nonlinear  function  f(t,x(t))  (except  we  require  f(t,0)  =  0,  so  that  the 
origin  of  the  state  space  remains  an  equilibrium  state  for  this  class  of  model  errors 
or  disturbances).  Furthermore,  we  assume  that  exact  expressions  for  f(t,x(t))  are 
unknown  and  only  bounds  on  f (t,  x(t))  are  known.  The  central  question  we  address 
here  is  the  following;  “Given  that  A  is  asymptotically  stable,  and  without  using 
spedfic  knowledge  of  f  (t,  x(t)),  is  it  possible  to  obtmn  a  bound  on  all  f  (t,  x(t))  such 
that  the  system  maintains  its  stability?"  Put  another  way,  can  we  determine  some 
measure  of  how  large  f(t,x(t))  can  be  without  destabilizing  a  given  stable  linear 
system?  Some  insights  on  these  issues  are  embodied  in  the  following  theorem: 


Theorem  3.7  [Patel  1980] 

Suppose  that  the  system  of  Eq.  (3.13)  is  asymptotically  stable  for  f  (t,  x(t))  =  0,  then 
the  system  remains  asymptotically  stable  for  all  nonzero  perturbations  f(t,x(t)) 
which  are  suffidently  small  that  they  satisfy  the  following  inequality 


llfll  ,minA(Q)_ 

11x11 -max  A(P)-^'^ 


(3.14) 


where  P  and  Q  satisfies  the  following  Lyapunov  equation 


A'rp-bPA  =  -2Q 


■ 


and  where  the  otherwise  arbitrary  f(t,x(t))  vanishes  at  the  origin  f(t,0)  =  0. 


120 


Stability  and  Control  of  Nonlinear  Mechanical  Systems  Ch.  3 


The  proof  of  this  theorem  is  given  in  [Patel  1980],  [Junkins  1993].  Since  P  is  a 
positive  definite  matrix,  the  maximum  eigenvalue  of  P  is  same  as  the  largest  singular 
value  of  P.  It  has  been  also  shown  in  [Patel  1980]  that  when  the  identity  matrix 
is  chosen  for  Q,  Hrr  in  Eq.  (3.14)  is  a  maximum  and  for  this  choice,  can  be 


expressed  as  ^ 

^"^max  A(P)  ^  <7„ax(P)  ■ 


(3.15) 


The  above  bound  b  often  very  conservative,  since  it  b  only  a  sufficient  condition 
for  the  stability  of  the  system,  and  thb  stringent  bound  b  not  usually  necessary. 
An  important  spedal  case  b  for  the  class  of  perturbations  having  the  Unear 


structure 

f(t.x(t))  =  Ex(t) 


(3.16) 


Clearly  thb  corresponds  to  an  additive  error  in  the  A  matrix  (i.e.,  A  — ►  A  +  E). 
We  can  apply  Theorem  3.7  to  arrive  at  the  desired  result;  we  can  establbh  that  the 
system  remrins  stable  if  E  b  bounded  by  the  foUowing  modified  stability  margim 


I|E11< 


min  [-3i{Ai(A)}] 

/:(<>) 


(3.17) 


where  K{^)  is  the  condition  number  of  and  is  the  normalized  eigenvector 
(modal)  matrix  of  A.  The  condition  number  definition  used  here  is  the  ratio  of  the 
largest  and  least  singular  values  of 


x:($)  = 


O’max(^) 


As  is  evident  in  the  above  discussion,  the  stability  margin  is  closely  related  to 
the  Patel-Toda  robustness  margin;  the  “more  stable”  the  nominal  system  is,  the 
larger  the  boimd  on  the  allowable  perturbation  E  becomes.  However,  the  important 
ingredient  evident  in  Eq.  (3.17)  is  the  fact  that  a  large  condition  number  /C($) 
degrades  the  effective  stability  margin.  Qualitatively,  if  the  eigensolution  is  highly 
sensitive  (large  condition  number),  then  it  is  easier  to  introduce  destabilizing 
perturbations,  and  generally,  the  stability  margin  (distance  of  eigenvalues  from  the 
imaginary  axis)  should  be  considered  simultaneously  with  a  measure  of  sensitivity. 
The  intimate  connection  of  the  Patel-Toda  robustness  measure  (for  stability  of  linear 
dynamical  systems  in  the  presence  of  additive  perturbations)  to  the  Bauer-Fike 
Theorem  (for  conditioning  of  the  algebraic  eigenvalue  problem  [Junkins  1993]) 
is  clear. 

Note  that  the  condition  number  /C($)  approaches  its  smallest  possible  value  of 
unity  if  $  is  any  unitary  matrix  (one  for  which  =  I),  and  the  upper  bound  on 
the  condition  number  is  infinity  which  occurs  if  ^  is  any  singular  matrix.  Observe 
that  am  infinity  of  unitary  matrices  e^dst,  some  of  them  are  “closer”  to  ^  than 
others.  When  one  has  the  freedom  to  modify  A  (and  therefore  $),  a  natural  question 
arises:  for  a  given  class  of  A-modifications,  how  can  we  make  $  as  nearly  unitary 
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as  possible?  Of  course,  one  way  to  modify  the  A  matrix  is  through  design  of  a 
feedback  controller,  and  one  avenue  toward  designing  gans  in  linear  robust  control 
laws  is  to  maximke  the  right-hand  side  of  Eq.  (3.17)  by  mmimizing  K(^).  It  k  al^ 
of  significance  that  choice  of  actuator  locations  considered  simultaneously  with  the 
design  of  control  gains  can  often  significantly  reduce  the  condition  number 
These  ideas  provide  some  of  the  motivation  for  the  robust  eigcnstrueiurc  algontkms 
and  actuator  placement  optimization  approaches  presented  in  [Junkms  1993J. 

3.4  NONLINEAR  AND  TIME  VARYING 
DYNAMICAL  SYSTEMS 

In  this  section,  we  present  stability  analysis  methods  for  nonlinear  systems.  ^ 
section  3.4.1,  we  conader  a  method  known  as  Lyapunov’s  indirect  (or  first)  method, 
whereby  we  can  determine  partial  stability  information  for  nonlinear  systems  by 
.^.mining  the  behavior  of  locally  linearized  systems.  In  section  3.4.2,  we  develop  an 
important  result  which  provides  easy-to-test  sufficient  conditions  to  determine  if  we 
have  asymptotic  stability  in  spite  of  the  common  situation  that  the  energy  function’s 
time  derivative  is  only  a  negative  semidefinite  function  of  the  state  variables.  In 
addition  to  the  classical  stabUity  analysis  for  whiA  the  Lyapunov  methods  were 
developed,  these  ideas  can  be  used  to  motivate  design  methods  which  yield  control 
laws  for  control  of  large  maneuvers  for  distributed-parameter  systems. 

This  approach  is  used  throughout  the  remainder  of  this  chapter.  In  section 
3.5,  we  consider  a  nonlinear  multibody  idealization  of  two  robots  cooperatively 
manipulating  a  payload.  Both  open-loop  and  feedback-control  designs  are  studied, 
and  Lyapunov  methods  are  used  to  ensure  path  stability  of  the  resulting  closed-loop 
dynamics,  using  a  tracking  control  law. 

3.4.1  Local  Stability  of  Linearized  Systems 

Stability  analysis  of  linear  motion  arises  often  in  practical  analysis  of  nonlinear 
systems  when  we  are  concerned  with  motion  near  an  equilibrium  state.  The  results 
presented  in  section  3.3.1  enable  us  to  obtain  necessary  and  sufficient  conditions 
for  the  stability  of  linear  systems,  but  also  provide  us  a  method  for  determining 
the  local  stability  of  a  nonlinear  system  by  linearization,  which  is  called  Lyapunov  s 
indirect  method. 

Consider  the  autonomous  system 

x(t)  =  f [x(t)]  with  f(xe)  =  0  (3.18) 

Let  z(t)  be  the  perturbation  (departure  motion)  from  the  equilibrium  state  as 

x(t)  =  xe  +  z(t)  (3.19) 

Using  Taylor’s  series  expansion  of  f(-)  around  the  equilibrium  state  xei  we  can  write 

tl,(l)  +  xel  =  f(*.)  +  +  0W‘>1=  (3-20) 
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Using  Eq.  (3.20)  in  Eq.  (3.18)  gives  the  perturbation  equation 

i(t)  =  Aa!(t)  +  0[z{t)]^  (3.21) 

where  A  denotes  the  Jacobian  matruc  of  f  evaluated  at  x  =  Xe.  A  =  [l^] 

l®*Jx=Xe 

and  so  we  find  the  linear,  constant  coefficient  matrix  differential  equation 

i(t)  =  Az(t)  (3.22) 

The  following  theorem  is  given  here  (without  proof);  this  is  the  main  stabUity 
result  of  Lyapunov’s  indirect  method. 

Theorem  3.8:  Lyapunov's  indirect  Method 

If  the  linearized  system  (Eq.  (3.22))  is  asymptotically  stable,  then  the  original 
nonlmear  system  [Eq.  (3.18)]  is  also  asymptotically  stable  if  the  motion  initiates 
in  a  sufBciently  small  neighborhood  containing  the  equilibrium  state. 

■ 

The  above  theorem  is  useful  since  we  can  analyze  the  local  stability  of  an 
equilibrium  state  of  a  given  nonlinear  system  by  examining  a  linear  system. 
However,  the  conclusions  based  on  linearizations  are  local,  and  therefore  to  study 
global  stability,  we  should  rely  on  Lyapunov’s  direct  method.  On  the  other 
hand,  if  one  can  find  all  equilibrium  points  and  investigate  their  local  stability, 
a  farly  complete  picture  of  the  overall  global  stability  characteristics  can  often 
be  derived.  Note  that  one  key  shortcoming  (of  the  indirect  approach)  b  the 
absence  of  information  on  the  size  or  boundary  of  the  “dom^  of  attraction”  of 
each  locally  stable  equilibrium  point;  thb  b  precisely  the  information  which  a 
completely  successful  application  of  the  direct  approach  determines.  Finally,  we 
note  the  most  important  point:  if  the  linear  motion  is  critical  (e.g.,  zero  damping, 
some  eigenvalues  have  zero  real  parts),  then  the  stability  of  the  locally  linearized 
analysis  should  be  considered  inconclusive  and  nonlinear  effects  must  be  included  to 
conclude  local  stability  or  instability. 

3.4.2  What  to  Do  When  U  is  Negative  Semidefinite? 

Several  subtle  possibilities  arise  if  the  function  derived  for  U  b  not  negative  definite. 
For  a  significant  fraction  of  the  practical  occurrences  of  thb  condition,  including 
several  applications  considered  subsequently  in  thb  chapter,  we  can  prove  global 
asymptotic  stability  in  spite  of  the  fact  that  the  function  derived  for  U  is  negative 
semidefinite.  The  mmn  results  from  the  traditional  literature  for  dealing  with  thb 
problem  are  embodied  in  a  theorem  due  to  [LaSalle  1961);  thb  theorem  sometimes 
allows  us  to  conclude  that  we  have  local  asymptotic  stability  for  the  case  that  U  >  0 
and  U  <  0,  provided  we  can  prove  that  the  equilibrium  point  b  contained  in  a  region 
of  state  space  known  as  the  maximum  invariant  subspace  M. 
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The  maximum  invariant  subspace  is,  essentially,  the  largest  domain  M  containing 
an  equilibrium  point,  for  which  all  trajectories  evolve  such  that  U  >  0  and  U  ^  0  for 
all  time  along  the  trajectories,  with  U  =  0  being  approached  only  occ^ionally  (at 
most)  at  isolated  apogee-like  states  that  are  not  equilibrium  poinU  («-«-.  U  is  negative 
almost  everywhere  except  its  asymptotic  approach  to  zero  at  the  equihbrium  state 

which  is  a  minimum  of  U).  .....  ^  u*  u 

It  is  usually  easy  to  identify  the  subset  Z  of  points  m  the  state  space  for  which 

U  =  0  but  LaSalle’s  maximum  invariant  subspace  M  is,  in  general,  a  subset  of 
Z  The*  TT>a«"  challenge  of  applying  LaSalle’s  theorem  then  reduces  to  the  quest 
to  identify  or  approximate  M;  this  is  difficult  when  the  differential  equations  are 
compUcated  nonlinear  functions.  While  these  ideas  are  elegant,  we  dect  not  to 
discuss  the  search  for  M  in  detail,  but  rather  we  present  a  recently  developed  result 
[Mukherjee  1992a,  1992b,  1993],  [Junkins  1993]  which  is  often  easier  to  apply. 

Prior  to  stating  the  theorem,  we  introduce  some  notations:^  Let  x  —  0  be  an 
equilibrium  state  of  the  nonlinear  system  x  =  f  (t,  x),  where  f  is  a  smooth,  twice 
differentiable  n-vector  function  oft  and  x.  Note  that  the  trajectories  of  the  nonhneM 
differential  equation  x  =  f(t,x)  generates  a  smooth  vector  field  in  the  re^on  fi 
which  includes  x  =  0.  Let  U(t,x)  be  a  scalar  analytic  function  in  Q,  which  is  locally 
positive  definite.  Suppose  tl(t,x)  is  only  negative  semidefinite.  Let  Z  denote  the 
set  of  points  for  which  U(t,x)  vanishes.  We  wiU  be  concerned  with  the  first  k 
derivatives  evaluated  on  the  set  Z.  We  are  now  prepared  to  state  the  theorem: 


Theorem  3.9 

A  sufficient  condition  for  asymptotic  stability,  when  U  >  0  and  U  <  0  for  all  x  € 
is  that  the  first  (k-1)  derivatives  of  U  vanish  on  Z,  up  through  some  even  order  (k-1) 

^  =  0,  V  x€Z,  for  j  =  l,2,...,  k-1  (3.23) 

dti 

and  the  first  (the  kth)  nonzero  derivative  of  U  (evaluated  on  Z)  is  of  odd  order  and 
is  negative  definite  for  all  points  on  Z; 


— ^  <0,  V  X  €  Z,  for  k  odd  (3-24) 

dt* 

In  the  event  that  all  infinity  of  U  derivatives  vanish  on  Z,  sufficient  conffitions  for 
stability  are  that  U  is  positive  definite  and  that  x  =  0  is  the  only  equilibrium  point. 


The  proof  of  this  theorem  is  given  in  [Mukherjee  1992a,b].  As  evident  below, 
this  theorem  is  easy  to  apply  to  nonlinear  and  distributed  parameter  systems.  In 
the  following  example,  it  is  also  shown  to  be  useful  for  determining  the  stability  of 
time  varying  systems. 
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[Mukherjee  19d2a] 

Consider  the  damped  Mathieu  equation:  xi=X2,  X2  =  — X2  —  (2  +  sint)xi. 
We  select  the  candidate  Lyapunov  function:  U(t,xi,X2)  =  Xi’  +  J •  ^W^h 
we  observe  is  positive  definite  and  analytic  for  all  (t,xi,X2).  Upon  differentiation 
of  U,  and  substitution  of  the  equations  of  motion,  we  find  that 


U(x) 


-xr’gW. 


where 


_/x\  4  +  2  sint  +  cost 

(2  +  sint)2 


Even  though  U(x)  is  nonpositive,  since  U(x)  does  not  depend  upon  Xi,  it  is  obviously 
not  negative-definite  and  without  further  analysis,  we  can  only  conclude  mere 
siabUiiy;  however,  we’d  like  to  make  a  stronger  statement  and  conclude  asymptotic 
stability.  This  can  be  done  by  considering  the  applicability  of  Theorem  3.9.  Note 
that  the  set  Z  of  points  for  which  U(x)  vanishes  is  the  set  of  all  real  values  for  x^, 
and  zero  values  for  X2.  Upon  taking  the  second  and  third  derivatives  of  U,  and 
evaluating  them  on  Z,  we  find  that 


-^  =  0,  and  =  -2(2+sint)Vt)xa*,  V  x€Z 

Since  the  second  derivative  of  U  vanishes  on  Z  and  the  third  derivative  is  negative 
on  Z,  except  at  the  origin,  we  conclude  that  all  of  the  conditions  of  Theorem  3.9 
are  satisfied;  indeed  this  system  is  proven  globally  asymptotically  stable. 


3.4.3  Lyapunov  Control  Law  Design  Method 

Here,  we  present  a  method  for  generating  globally  stable  feedback  control  laws  for 
maneuvers  of  nonlineair  systems  and  distributed  parameter  systems.  A  Lyapunov 
function  is  selected  which  is  conserved  for  the  uncontrolled  system.  Then  when 
the  control  u(t)  0  is  considered,  U(x)  depends  upon  u(t)  through  the  equations 
of  motion-  One  strategy  is  to  select  the  control  function  u(t,x)  (from  a  set  of 
admissible  controls)  to  make  U(x)  as  negative  as  possible;  this  Lyapunov  Optimal 
control  strategy  ensures  that  U(x)  will  locally  approach  zero  as  fast  as  possible. 
On  the  other  hand,  any  control  law  which  makes  U(x)  negative  is  asymptotically 
stabilizing,  and  in  many  instances,  it  will  be  seen  that  very  simple,  yet  globally 
stable  control  laws  can  be  determined  which  are  attractive  for  applications. 

We  will  use  specific  dynamical  systems  to  introduce  Lyapunov  control  design 
methods  for  nonlinear  and  distributed-paLrameter  systems,  A  useful  viewpoint  is  to 
consider  simultaneously  U(x)  and  u(t,  x)  “available  for  selection”  in  the  design  pro¬ 
cess;  the  class  of  problems  for  which  globally  stable  feedback  laws  can  be  obtained 
is  surprisingly  large.  There  is  coupling  between  the  selection  of  the  Lyapunov  func¬ 
tion  and  the  corresponding  stabilizing  control  laws.  We  place  the  initial  emphasis 
upon  using  work/energy  methods  together  with  stability  theory  to  determine  the 
structure  of  a  stabilizing  feedback  law  and  thereby  parameterize  an  infinite  family 
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of  stable  controllers.  Conventional  nonlinear  programmmg  algorithms  can  then  be 
invoked  to  optimize  some  specified  closed  loop  performance  criterion  over  the  sta¬ 
ble  set.  This  gives  rise  to  “Lyapunov  optimal”  control.  Although  we  subsequently 
develop  methods  for  controlling  multi-body  manipulators,  and  for  distributed  pa¬ 
rameter  systems  governed  by  hybrid  coupled  sets  of  ordinary  and  partial  differential 
equations,  we  first  consider  a  system  described  by  a  6-ih  order  set  of  nonlinear,  or¬ 
dinary  differential  equations. 

Example  3.6  Large  Angle  Rigid-Body  Maneuvers 

Some  key  ideas  are  easily  introduced  by  considering  general  three  dimensional 
nonlinear  maneuvers  of  a  single  rigid  body.  The  equations  governing  large  motion 
can  be  vrritten  as  [Junkins  1986] 

=  (I2  —  l3)a;2W3  + 

I2U2  =  (I3  +  ^2 

I3W3  =  (Ii  —  +  ^3 

2qi  =  wx  —  U2<iz  +  ti;3q2  +  +  <12^2  +  ^3^3)  (3.25) 

2q2  =  a;2  —  +  t^iqa  +  q2(<lit*^i  +  ^7^2  +  <l3t^3) 

2q3  =  ^3  —  ti;iq2  +  +  ^2^2  +  ^3^3) 


where  (wi,(J2ft^3)  and  (qi,q2iq3)  are  the  principal  axis  components  of  angular 
velocity  and  the  Euler-Rodriguez  parameters  (“Gibbs  vector”),  respectively.  Note 
that  (Ii ,  I2, 13)  and  (ui ,  U2,  U3)  are  the  principal  moments  of  inertia  and  the  principal 
axis  components  of  the  external  control  torque,  respectively. 

For  the  case  of  zero  control  torque,  it  can  be  readily  verified  that  total  rotational 
kinetic  energy  is  an  exact  integral  of  the  motion  described  by  differential  Eq.  (3.25), 
viz.,  2T  =  (Iic*^i  +  Motivated  by  the  this  total  system  energy  integral, 

we  investigate  the  trial  Lyapunov  function 

U  =  +  I2W|  +  l3w|)  +  ko(q|  +  ql  +  q|) 

=  kinetic  energy  +  ko  tan^  ^  ^  ^ 

where  4>  is  the  instantaneous  principal  roiaiion  angle  (about  the  instantaneous 
Eulerian  principal  rotation  axis,  from  the  current  angular  position  to  the  desired 
final  angular  position  of  the  body  [Junkins  1986].  It  is  apparent  that  the  additive 
term  ko(qi  +  ql  +  ^)  viewed  as  the  potential  energy  stored  in  a  conservative 

spring,  and  as  will  be  evident  below,  this  is  just  the  most  obvious  choice  for  a 
positive  measure  of  departure  from  the  orientation  qi  =  0,  q2  =  0,  qa  =  0)  .  We 
can  anticipate  that  the  system  dynamics  will  evolve  such  that  U  is  constant  if  the 
only  external  torque  is  the  associated  conservative  moment.  Of  course,  we  are 
not  interested  in  preserving  U  as  a  constant,  but  rather  we  seek  to  drive  it  to 
zero,  because  it  measmes  the  departure  of  the  system  from  the  desired  equilibrium 
state  at  the  origin.  We  further  anticipate  the  necessity  to  determine  an  additional 


126 


Stability  and  Control  of  Nonlinear  Mechanical  Systems  Ch.  3 


judicious  control  moment  to  guarantee  that  U  is  a  decreasing  function  of  time.  It 
is  obvious  by  inspection  that  U  is  positive  definite  and  vanishes  only  at  the  desired 
state  qi  =  w,  =  0.  Differentiation  of  Eq.  (3.26)  and  substitution  of  Eqs.  (3.25)  lead 
directly  to  the  following  (*^ower”)  expression  for  U: 

3 

U  =  5];a;i[ui  +  koqi(l  +  q?+qi  +  ql)]  (3.27) 

Of  all  of  the  infinity  of  possible  control  laws,  we  can  see  that  any  control  u, 
that  reduces  the  bracketed  terms  to  a  function  whose  sign  is  opposite  to  ui  will 
guarantee  that  V  is  ‘globally  negative  semudefinite.  The  simplest  choice  consists  of 
the  following;  Select  uj  so  that  i-th  bracketed  term  becomes  — kiWi.  This  gives  the 
control  law 


Ui  =  -  [kiWi  +  koqi(l  +  q?  +  qi  +  ql)]  i  i  =  1, 2, 3  (3.28) 


The  closed  loop  equations  of  motion  are  obtained  by  substitution  of  the  control 
law  of  Eq.  (3.28)  into  the  equations  of  motion  of  Eq.  (3.25)  to  establish 


Iicii 

I2W2 


=  (I2  l3)u^2^3  ^ 

=  (I3  -  Il)W3Wi  « 
=  (Ii  —  l2)wia;2  — 


kiOJi  +  koqi(l  +  q?  +  ql  +  <ll)] 
k2a;2  +  koq2(l  +  q?  +  ql  +  ql) 

’k3W3  +  koq3(l  +  qi  +  ql  +  q|)^ 


(3.29) 


Since  U  =  — (kiw^  +  k2ti;2  +  k3a;|)  does  not  depend  upon  the  q’s,  it  is  only  a 
negative  semi-definite  function,  and  while  we  have  stability^  if  we  choose  all  kj  >  0, 
we  cannot  immediately  conclude  that  we  have  asymptotic  stability.  We  can  prove 
that  we  do  indeed  have  asymptotic  stability,  for  illumination  we  estabilish  this  truth 
by  two  logical  paths. 

Path  1:  This  analysis  is  physically  motivated,  we  try  to  see  if  there  is  some 
equilibrium  point  or  trajectory  other  than  the  target  state  (the  origin)  where 
the  system  can  get  “stuck”  with  U(x)  =  0.  We  directly  investigate  the  above 
three  closed  loop  equations  of  motion  (Eqs.  (3.29)]  for.  the  existence  of  equilibrium 
points  in  these  nonlinear  closed  loop  equations  of  motion.  It  can  be  verified 
that  (qi,q2iq3iWi,<<^2iW3)  =  (0,0, 0,0, 0,0)  is  the  only  cqmlibrium  state  where 
all  velocity  and  acceleration  coordinates  vanish.  In  fact,  imposing  the  conditions 
(tJi,(I;2,W3)  =  (0,0,0)  and  (a;i,W2,W3)  =  (0,0,0)  on  the  above  closed  loop  equations 
of  motion  immediately  gives  the  requirement  that  the  q’s  satisfy  the  three  equations 


0  =  -[koqi(l  +  qx  +  q2  +  ql)],  fori  =  1,2,3 

and  it  is  obvious  by  inspection  that  these  three  nonlinear  equations  are  simultane¬ 
ously  satisfied  only  at  the  origin. 

Since  we  have  shown  that  (tii  0,(J2  7^  0,^3  -jk  0),  for  (qi  ^  0,  q2  7^  0,  q3  ^  0), 
everywhere  except  the  origin  x  =  (qi,q2,q3,<*'i,««^2,W3)*^  =  (O.O.O.O.O.O)*^,  we 
conclude  that  U(x)  =  0  can  only  be  encountered  for  (qi  ^  O.q?  #  O.  qa  ^  0)  at 
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(possibly)  apogee-like  points  in  the  behavior  of  U  (U  instantanTOusly  Amishes 
but  these  points  cannot  be  equilibrium  states  because  (wi  #  O.wj  #  0,W3  ^6  0). 
Therefore,  we  are  guaranteed  that  U(x)  <  0  almost  everywhere  (thus,  we  have 
the  ideal  situation  that  the  largest  invariant  subspace  is  all  of  state  space].  *  We 
asymptotically  approach  the  origin  from  all  finite  initial  states  and,  therefore,  have 
global  asymptotic  stability. 

Path  2:  This  analysis  is  more  formal  and  procedural  (exactly  analogous  to 
Example  3.5),  we  simply  apply  Theorem -3.9.  First  notice  the  set  Z  where  U(x) 
vanishes  is  the  set  of  arbitrary  real  values  for  the  q’s  and  zero  values  for  the  w’s.  It 
can  be  verified  by  direct  differentiation  of  U  that,  for  general  motion 

^  =  -2f;kiurM,  and  ^  =  _2^ki(.^.*-h«M).  (3.31) 

i=l 

Upon  evaluation  of  these  derivatives  on  Z  where  angular  vdodty  vanishes 
(a;i,C4;2,c*;3)  =  (0,0,0),  from  the  closed  loop  equations  of  motion,  the  nonzero  accel¬ 
eration  components  are  u\  =  — ko(l  +  Qi  +  ql  + 


d^U 

dt2 


0,  and  = —^0(1 -hqi  +  ql  +  ^  x€Z 

i=i  * 


(3.32) 


Since  the  second  derivative  of  U  vanishes  everywhere  on  Z,  the  third  derivative  is 
negative-definite  everywhere  on  Z,  the  conditions  of  Theorem  3.9  are  fully  satisfied, 
and  we  again  conclude  that  the  nonlinear  control  law  of  Eq.  (3.28)  gives  us  globally 
asymptotically  stable  attitude  control. 

Since  we  have  shown  U  to  be  a  positive-definite,  decreasing  function  of  time 
along  all  trajectories,  and  since  it  vanishes  at  the  origin,  then  the  necessary  and 
sufficient  conditions  are  satisfied  for  global  Lyapunov  stability.  We  have  implicitly 
excluded  the  geometric  singularity  (qj  — ►  00)  associated  with  this  parameterization 
of  rotational  motion  as  ^  we  can  use  the  quaternion  or  Euler  parameter 

description  of  motion  and  avoid  all  geometric  singularities  as  well.  This  path  has 
been  successfully  pursued  in  [Oh  1991],  [Wie  1989]. 

The  nonlinear  feedback  control  law  of  Eq.  (3.28)  guarantees  stability  of  the 
nonlinear  closed-loop  system  under  the  assxunption  of  zero  model  errors.  In 
practice,  of  course,  guaranteed  stability  in  the  presence  of  zero  model  error  is  not 
a  sufficient  condition  to  guarantee  stability  of  the  actual  plant  having  arbitrary 
model  errors  and  disturbances.  On  the  other  hand,  rigorously  defining  a  region 
in  gain  space,  guaranteeing  global  stability  for  our  best  model  of  the  nonlinear 
system  is  an  important  step;  it  is  reasonable  to  restrict  the  optimization  of  gains 
to  this  stable  family  of  designs.  The  determination  of  the  peirticular  gain  values, 
selected  from  the  space  of  globally  stabilizing  gains,  is  usually  based  on  performance 
optimization  criteria  specified  in  consideration  of  the  disturbance  environment, 
sensitivity  to  model  errors,  desired  system  time  constants,  actuator  saturation,  and 
sensor/actuator  bandwidth  limitations. 
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Before  generalizing  the  methodology  to  consider  multibody  and  partial  differ¬ 
ential  equation  systems,  it  is  important  to  reflect  on  the  selection  of  the  Lyapunov 
function  previously  given.  Notice  that,  if  a  system  has  no  inherent  stiffness  with 
respect  to  rigid-body  displacement,  it  is  necessary  to  augment  the  open-loop  energy 
inteCTal  by  a  pseudopotential  energy  term  [such  as  ko(qi  +  q2  +  <13)  “  the  preceding 
example];  generally  speaking,  the  pseudoenergy  terra  should  be  defeed,  if  possible, 
such  that  the  resulting  candidate  Lyapunov  function  (U)  is  a  positive  definite  mea¬ 
sure  of  departure  motion  that  has  its  global  minimum  at  ihc  desired  target  siaie. 
Then  the  stUl-to-be^ietermined  controls  are  usually  selected  as  simply  ^  possible 
(from  an  implementation  point  of  view)  to  force  pervasive  dissipation  (U  <  0)  of 
the  modified  energy  (Lyapunov)  function  along  all  trajectories  of  the  closed-loop 
system,  and  thereby  guarantee  dosed-loop  stability. 

To  illustrate  the  relationship  between  the  choice  of  the  Lyapunov  function  and 
the  resulting  family  of  stabilizing  control  law,  let  us  consider  a  slight  variation 
on  (Tsiotras  1994]  the  above  developments.  In  Ueu  of  the  Lyapunov  functions  of 
Eq.  (3.26),  we  choose  a  logarithmic  measure  of  attitude  error 


U  =  i  (liw?  +  I2W2  +  I3W3)  +  ko  In  (1  +  q?  +  ql  +  ql)  (3-33) 

» 

Proceeding  analogously  to  the  above  developments,  it  is  easy  to  verify  that 

U  =  53 ^  (3.34) 

1=1 


so  that  we  can  see  that  the  following  linear  feedback  law  is  globally  stabilizing 

Ui  =  -koqi-kiu;i,  i  =  1,  2,  3  (3.35) 


Contrasting  the  two  stabilizing  control  laws  of  Eqs.  (3.35)  and  (3.28),  it  is  clear 
that  the  simpler  Unear  law  of  Eq.  (3.35)  is  Ukely  more  attractive  as  regards 
implementation,  the  nonUnear  feedback  of  Eq.  (3.28)  is  found,  in  some 

circumstances,  to  give  a  desirable  closed  loop  response. 

This  example  points  out  clearly  the  coupling  between  selection  of  the  **  error 
energy  measure”  and  the  resulting  globally  stabilizing  controUersj  the  situation  is 
quite  analogous  to  appUcations  of  optimal  control  theory,  wherein  there  is  coupling 
between  the  choice  of  the  performance  index  and  the  resulting  optimal  control  law. 
Although  the  above  insights  are  useful,  definitive  criteria  for  optunal  selection  of  the 
Lyapunov  function  do  not  exist.  However,  the  above  examples  suggest  an  attractive 
strategy  that  defines  the  ’main  part’  of  the  Lyapunov  function  with  relative  weights 
on  the  portions  of  total  mechanical  energy  associated  with  structural  subsystems 
[Junkins  1993],  and  use  of  the  work/energy  method  provides  a  very  efficient  bypass 
of  most  of  the  algebra  and  calculus  leading  to  the  power  equations,  analogous  to 
Eq.  (3.27),  for  each  particular  physical  system  [Oh  1991].  The  lack  of  uniquene^ 
of  the  Lyapunov  function  is  not  necessarily  a  disadvantage  in  practice  because  it 
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is  a  source  of  user  flexibility  providing  needed  control  design  freedom  qualitatively 
comparable  to  the  freedom  one  has  in  selecting  performance  indices  when  applying 
optimal  control  theory.  Indeed,  formulating  the  Lyapunov  function  as  a  weighted 
error  energy  to  be  dissipated  by  the  controUer  is  qualitatively  attractive  for  both 
linear  and  nonHnear  qrstems,  since  this  gives  intuitive  and  physical  meaning  to  the 
Lyapunov  function  and  the  corresponding  control  g^. 


3.5  COOPERATIVE  CONTROL  OF  MULTIBODY 
MANIPULATORS 


3.5.1  Mechanics 

Prior  to  addressing  the  first  of  two  studies  wherein  the  above  ideas  are  applied, 
consider  the  class  of  dynamical  systems  whose  behavior  is  governed  by  the  discrete 
coordinate  version  of  Lagrange’s  equations 


or,  in  matrix  form 


dtVflq/  dq 


(3.36) 

(3.37) 


where  the  Lagrangian  C,  is  defined  in  the  classical  form  C,  —  T  —  V.  Restrictions 
imposed  in  deriving  Eqs.  (3.37)  are  such  that  the  coordinates  q;  are  independent 
functions  of  time  only  and  that  the  potential  and  kinetic  energies  have  the  functional 
forms  T  =  T(q,q,t),  V  =  V(q),  and  the  nonconservative  virtual  work  has  the 
form  dWnc  =  Eili  QMi  =  Thus,  Eqs.  (3.37)  are  valid  for  nonlinear, 

nonconservative  systems  as  well  as  linear,  conservative  systems. 

A  modest  generalization  allows  Eqs.  (3.37)  to  be  applied  to  significant  classes 
of  redundant  coordinate  or  constrained  systems  (i.e.,  the  coordinates  q-,  are  not 
independent).  To  accommodate  kinematic  constraints  which  depend  on  the  qs  and 
their  time  derivatives,  Lagrange  multipliers  can  be  introduced  to  generate  additive 
generalized  construnt  forces  on  the  right-hand  side  of  Eqs.  (3.37)  [J unkins  1986]. 
In  particular,  for  m  Pfaffian  (linear  in  the  generalized  velocities)  constraints  of  the 
matrix  form 

Aq-f  a<j  =  0  (3.38) 


The  generalized  constraint  force  that  needs  to  be  added  to  the  right-hand  side 
of  Eqs.  (3.37)  is  the  vector  A'^A,  where  q  is  an  N  x  1  vector  containing  the 
generalized  coordinates,  A  =  A(q)  is  an  m  x  n  continuous,  differentiable  matrix 
function,  ao(q)  Is  a  smooth,  m  x  1  vector  function,  and  A  is  an  m  x  1  vector  of 
Lagrange  multipliers.  One  standard  solution  process  is  to  differentiate  the  kinematic 
constraint  of  Eqs.  (3.38)  to  obtain 


Aq-bAq  +  ao  =  0 


(3.39) 
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Equation  (3.39)  can  be  solved  simultaneously  with  Eqs.  (3.37)  for  q  and  A,  to 
determine  the  coordinate  accelerations  and  constraint  forces  as  a  function  of  the  qs 
and  their  time  derivatives.  Note  that  the  N  differential  equations  of  Eqs.  (3.37)  must 
be  solved  simultaneously  with  the  m  kinematic  constraint  differential  equations 
[Eqs.  (3.37)]  in  order  to  determine  the  N+m  unknowns  in  the  vectors  q  and  A(t). 
During  recent  years,  significant  methodlolgy  has  evolved  for  effecting  numerical 
solutions  for  differential/algebraic  systems  of  equations,  see  Ahmad  1991  and 
Krishnan  1992  for  discussion  of  the  recent  literature. 

For  a  significant  class  of  systems,  the  algebra  and  calculus  required  in  a 
straightforward  application  of  Lagrange’s  equations  can  be  dramatically  reduced. 
For  the  the  most  common  case  of  natural  systems  for  which  the  kinetic  energy  is  a 
symmetric  quadratic  form  in  the  generalized  coordinate  time  derivatives,  one  finds: 


T  = 


N  N 


i=l  i=l 


(3.40) 


Note  that  q  is  an  N  x  1  configuration  vector  of  generalized  coordinates.  It  is 
convenient  (and  important)  to  collect  the  mass  matrix  M  =  M(q)  before  the 
differentiations  implied  by  Lagrange’s  equations  are  carried  out;  this  simple  point 
seems  to  elude  many  individuals  when  symbolic  codes  are  written  to  automate 
derivation  of  equations  of  motion.  Including  the  possibility  of  Pfaffian  nonholonomic 
constraints,  the  equations  of  motion  follow  from  Eqs,  (3.37)  as  the  following  N+m 
system  of  differential  and  algebraic  equations: 

Mq  +  G  +  |^  =  Q  +  A'^A.  Aq  +  ao  =  0  (3.41) 

where  is  the  N  x  1  vector  gradient  of  the  potential  energy  function,  and 
G  =  G(q,  q)  is  the  N  x  1  vector: 


G  =  rc<»4  -  4^0^41^.  =  +  (3.42, 

and  where  the  last  equation  that  generates  the  typical  element  cj^^  of  the  NxN 
symmetric  matrix  =  C(’)(q)  is  the  Christ ojffc/ operator. 

It  is  apparent  that  deriving  the  equations  of  motion,  for  natural  systems  subject 
to  PfafSan  nonholonomic  constraints,  has  been  reduced  to  formation  of  the  kinetic 
energy  to  identify  the  mass  matrix,  then  carrying  out  the  indicated  gradient 
operations  in  Eqs.  (3.42),  (3.43)  on  the  mass  matrix  elements  mjk  and  the  potential 
energy  to  form  the  vectors  G  =  G(q,  q)  and  dY/dq. 

For  the  case  that  the  nonconservative  forces  are  generated  by  an  nxe  x  1  vector 
u  of  control  inputs,  we  have  Q  =  Bu  and  Eqs.  (3.41)  assume  the  following  form 

M(q)q  +  +  G(q,q)  =  J5u  +  A(q)’*A 

A(q)q  +  ao(q)  =0 
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In  order  to  appreciate  some  of  the  issues  of  cooperation  associated  with  control 
design  for  redundantly  actuated  systems,  we  consider  a  specific  example  in  the 
following  discussion. 

3.5.2  A  Prototype  Cooperative  Control  Example 

Equations  of  Motion 

Consider  the  p^  of  robot  arms  shown  in  figure  3.1.  We  assume  four  active  joints; 
namely,  the  shoulder  and  elbow  joints  on  the  left  and  right  robots,  for  simplicity; 
the  wrist  torques  are  neglected.  The  objective  is  to  design  a  feedback  controller  to 
command  the  four  torques  so  as  to  stabilize  the  payload  with  respect  to  a  prescribed 
trajectory  of  the  payload  moving  from  an  arbitrary  reachable  State  A  to  an  arbitrary 
reachable  State  B.  It  is  desired  that  the  control  law  have  the  following  attributes: 

1.  Accommodate  an  arbitrary  feasible  reference  trajectory. 

2.  Be  of  a  simple  feedforward/output  output  error  feedback  form. 

3.  Guarantee  global  asymptotic  stability,  including  nonlinear  kinematics. 

4.  Handoff  smoothly  between  large  trajectory-tracking  motion  and  terminal  error 
suppression,  without  gmn  scheduling. 

We  present  a  control  strategy  possessing  these  four  desirable  attributes. 

Under  the  assumption  that  each  manipulator  is  composed  of  two  rigid  links,  that 
the  payload  is  a  rigid  body,  and  that  the  entire  system  undergoes  only  planar  motion, 
but  retaining  all  nonlinear  kinematic  effects,  the  kinetic  energy  of  the  system  has 
the  natural  form 

T  =H'^lM(q)]q 

(3.44) 

=  IqL^'tMLCqL)]^  +  |qR[MR(qR)]qR  -b  Hp  [Mp(qp)]qp 

where  the  configuration  coordinate  vector  natiirally  partitions  into  left(L),  right (R), 
and  payload(P)  configuration  coordinate  subsets  as 

..{f.), 

*1“  I  *1*^  62  I  06  h  '•  02  *c, 

The  7x7  system  mass  matrix  has  the  block  diagonal  structure 


Figure  3.1.  Cooperative  control  multibody  manipulator  experiment 
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where,  introducing  the  elbow  angles  'Oij  =  6j  —  6,,  the  substructure  mass  matrices 
are  compactly  written  as 

M  _\li  +  krnill  +  m2ll  ^mshhcose^ 

^  [  5^2/1 12^0*^12  ^2  +  4*^212  J 


(3.46) 


_  r  Is  +  ? ms/l  +  m^ll  ^m^hUcosees  1  xj 

[  ^TTLikUcOsOeS  U  +  j 


Mp  = 


(3.48) 


The  equations  of  motion  follow  in  the  form  of  Eq.  (3.43),  where,  using  Eq.  (3.42), 
the  nonlinear  vector  G(qi  <l)  has  the  following  specific  form 


G(q,«=|GR|. 


-m2^|/l/2S»”^12 


\  o"y  ^  \  -m^Uksine,,  \ 

(  miOlUhsinBes  J 

The  control  vector  (containing  the  four  shoulder  and  elbow  torques)  is 

U  =  {ui  li2  “6  “5}^ 


(3.49) 


(3.50) 


and,  using  the  virtual  work  principle,  we  can  establish  that  the  control  influence 
matrices  are 


Bl  O  j-j  _ii 

B=  O  Br  ,  Bl  =  Br=L  , 
O  O  J  L''  J 


(3.51) 


Upon  taking  the  origin  for  an  (x,y)  coordinate  system  as  the  base  hinge  point  of 
the  left  arm,  and  letting  the  x  axis  pass  through  the  base  binge  point  of  the  right 
arm,  the  geometric  constraints  arising  from  pinning  of  the  left  and  right  robot  wrists 
to  the  payload  at  points  Q  and  P  are  captured  by  the  four  holonomic  constraints: 


licosBi  +  I2COSB2  +  j/scos^s  —  Xjj  —  0 

lisinBi  +  l2sin82  +  \l3sinB3  —  ya  =0 

IsCOsBs  +  UcOsBs  —  ^l3COs63  — Xej  —  D  =0 
IssinBc  +  UsinBs  -  \l3sin83  —  Va  =0 


(3.52) 


Upon  differentiation  with  respect  to  time,  Eqs.  (3.52)  yield  a  kinematic  constr^nt 
of  the  Pffafian  form  (the  second  equation  of  Eqs.  (3.43)],  with  a©  =  0  and  with 


A(q)  = 


—lisinBi  l2sinB2 


_  licosBi 


I2COSB2 

0 

0 


0  0  —  5  Issm^a  —1  0 

0  0  i  I3COSB3  0  — 1 

—liSinBi  —UsinBs  |  /ssinffa  —1  0 

IscosBs  UcosBs  —  5 1300383  0  — 1 


(3.53) 
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and  also,  for  subsequent  use,  we  record  the  time  derivative  of  A  as 


’—Ii$ieos0i 

—llBisinSi 

0 

0 


A(q.q)  = 

^l202COs92  0  0 

^l202sin92  0  0 

•  • 

0  —IsSecosOg  —UBscosOs 

0  — —l^Sssinffs 


—jh^zcosBa 

—^tzBzsinds 

^IzBzcosBa 


0  0- 
0  0 
0  0 
0  0. 


(3.54) 


Now,  solving  the  first  of  Eqs.  (3.43)  and  Eq.  (3.39)  simultaneously  for  the 
generalized  constraint  force  Qc  =  A^A  and  Mq,  we  obt^ 


Qc  =  A'rA  =  Fi  +  F2U 
Fi  =  A'r(AM-iA'f)-i(G-Aq) 

Fa  =  -A'*'(AM-^AT)-^AM-^B 

and 

Mq+  G  =  Bu 

G  =  G  -  A‘r(AM-»AT)~‘  |aM-iG  -  Aq} 
B  =  [l  -  A'r(AM-^AT)"^AM-^]  B 
It  is  natural  to  introduce  the  consistent  partitions 


’Ml 

.  f  90  . 

Bl  ’ 

M  = 

Mr 

,  G  =  {  Gr  }.  B  = 

Br 

Mp, 

1  Gp  J 

.Bp  , 

and  rewrite  the  first  of  Eqs  (3.56)  as  three  equations 


(3.55) 


(3.56) 


(3.57) 


Ml^  +  9^(q,q)  =  BL(q,q)u 

Mr^  +  GR(q,q)  =  BR(q,q)u  (3.58) 

Mp^  +  Gp  (q,^  =  Bp(q,q)u 

This  constraint-free  form  of  the  equations  of  motion  implicitly  reflects  the  con¬ 
straints;  the  third  of  Eqs.  (3.58)  is  suffident  to  describe  the  dynamics  of  the  system, 
since  all  other  coordinates  can  be  determined  as  a  function  of  (qp,  qp)  through  use 
of  the  constraint  equations. 

Prior  to  discussion  of  control  law  design  approaches,  it  is  useful  to  consider  the 
inverse  kinematics  problem:  Given  a  smooth  desired  (prescribed)  payload  motion 
qp(t),  determine  feasible/desirable  corresponding  control  inputs.  Inverse  kinematics 
for  the  case  of  redundant  coordinates  involves  some  subtle  issues  which  are  captured 
in  the  follomng  sections. 


Inverse  Kinematics 

Notice  that  the  four  holonomic  constraints  of  Eqs.  (3.52)  reduce  the  number  of 
degrees  of  freedom  from  seven  to  three.  Thus,  in  principle,  we  could  derive  aU  co¬ 
ordinates  and  their  time  derivatives  history  from  a  given  trajectory  of  the  payload 


9«7 
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coordinates  qp(t)  =  [03(0  VcM)  •  Obviously,  if  we  know  all  of  the  coordi¬ 

nates  and  their  first  two  time  derivatives,  then  the  differtnUal  equations  of  inotion 
[Eqs.  (3.56)  or  (3.58)]  can  be  considered  algebraic  equations  for  determination  of 
the  corresponding  control  torques.  Since  there  are  only  three  degrees  of  freedom 
and  four  control  torques,  there  is  obviously  an  issue  of  uniqueness,  ^d  it  is  through 
the  expolitation  of  the  lack  of  uniqueness  that  we  can  seek  an  optimal  control  by 
which  the  robot  arms  may  cooperate  in  carrying  out  the  controlled  maneuver.  It  is 
also  important  to  anticipate  geometric  singularities  on  the  boundary  of  the  reach¬ 
able  region  (the  maximum  feasible  workspace).  First  let  us  consider  some  geometric 
issues. 

With  reference  to  Figure  3.1,  observe  that  a  given  motion  qp(t)  of  the  payload 
dictates  the  motion  of  points  P  and  Q  though  the  four  geometric  formulas: 


*<3  =  —  (^) 

VQ  =  Vci  -  (f) 

XP  =  *es  +  (  2  ) 

yp  =  y«s  +  (^)  **’”^3 


(3.59) 


and  obviously,  the  companion  equations  can  be  obtained  to  determine  the  first  two 
time  derivatives  of  the  grapple  point  coordinates  (xp,yp,X(j,yQ)  as  a  function  of 
the  payload  motion 


(®3| 


These  straightforward  equations  are  not  recorded  for  the  sake  of  brevity.  However, 
given  the  payload  motion,  we  can  obviously  determine  the  grapple  point’s  velocity 
and  acceleration  coordinates 


(ip  tVP  lyg  p  tVP  ^vq) 


by  differentiation  of  Eqs.  (3.59).  We  consider  how  to  determine  the  motion  of  the 
left  and  right  robot  arms.  Considering  the  geometry  of  the  left  robot  arm,  from 
Figure  3.1,  it  is  evident  that  the  left  shoulder  and  elbow  angles  6i  and  62  are  related 
to  the  instantaneous  position  of  the  grapple  point  (xq,|/q)  by 


Ox 

PlL 

Ihl 

^2 


hi  +  P21 

an-^yq/xQ) 

,  two,  roots,  take  >  0 

XQ^^llCOS$i  j 


COS 


.-1 


(3.60) 


Similarily,  considering  the  right  robot,  it  is  evident  that  the  right  robot  angles  6s 
and  ^5  are  related  to  (xp^yp)  by 
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^6 

PlK 

filR 

^5 


=  PlK  —  02R 
=  ian~^  (yi>/®p) 


)■ 


two  roots,  take  )82it  >  0 


(3.61) 


It  can  be  verified  taking  Ptl  and  An  positive  corresponds  to  the  “elbows  out” 
configuration  shown  in  Figure  3.1.  Obviously,  the  “elbows  in”  configuration  results 
from  dioosing  the  negative  signs  for  02L  and  02R,  and  two  other  asymmetric  con¬ 
figurations  are  possible  if  opposite  signs  are  selected.  The  lack  of  uniqueness  is  a 
consequence  of  redundancy  and  the  choice  of  control  modes  is  dictated  by  practi¬ 
cal  configurations.  Except  near  certain  singular  configmations  discussed  below,  it  is 
possible  to  manipulate  smoothly  through  an  infinite  family  of  neighboring  configura¬ 
tions  for  any  one  of  the  four  choices  on  signs  for  fti(t)  and  P2r{Y)-  Straightforward 
differentiation  yields  the  following  kinematic  equations  which  determine  the  first 
two  time  derivatives  of  the  left  and  right  shoulder  and  elbow  angles: 


{l}= 

{!:}= 


Ar' 


AR' 


}-*■■{  Ul 


where  we  have  introduced  the  matrices 


[^lisinOi  ^l2sin02^  .  _ 

licosOx  l2Cos62  J  *  ^  ”  L  hcosOe  UcosO^  J 


(3.62) 


(3.63) 


It  is  easy  to  verify  that  the  above  matrices  are  singular  if  =  02^  and  0e  = 
respectively.  It  is  obvious  that  these  singularities  corresponded  to  the  left  and  right 
arms  being  fully  extended,  and  it  is  clear  that  these  boundaries  of  the  workspace  are 
to  be  avoided  [the  reachable  set  of  points  interior  to  the  workspace  must  be  taken 
into  account  in  the  trajectory  planning  for  the  payload,  leading  to  the  nominal 
trajectory  qp(t)  of  the  payload]. 


Cooperative  Actuation 

Given  the  inverse  Idnematic  solution  for  all  system  coordinates  and  time  derivatives, 
as  a  function  of  a  prescribed  payload  trajectory  qp(t),  the  cooresponding  control 
torque  vector  u(t)  is  not  unique,  for  the  case  of  more  actuators  and  degrees  of 
freedom.  In  our  particular  example,  since  we  have  four  actuators  and  three  degrees 
of  freedom,  we  expect  an  infinity  of  torque  vectors  for  the  nominal  maneuver.  As 
in  the  case  of  human  beings  jointly  manipulating  a  heavy  object,  we  desire  to 


289 


exploit  the  redundancy  of  actuation  to  cooperate  in  the  sense  that  large,  nonworkmg 
““to 

cooperation  criterion  to  be  minimized 


J  =  iu‘’^WuU+  |Qc^ 


WcQc 


(3.64) 


subject  to  satisfying  the  tHrd  of  Eqs.  (3.58).  Notice  that  the  weight  matrixsel^tmn 
pewits  us  the  fl^biUty  of  emphasizing  small  torques  (u),  or  sm^ 

?0  or  a  compromise  between  these  two  competmg  objectives.  Using  the 

Lw«.S«  mi«pU»rol.,.wo  tatioduc  fto  m  x  1  Up^g.  multipliot  vector  7  a»d 
the^augmented  function  J.  and  use  Eqs.  (3.55),  (3.58)  to  write 


J  =  iu'^WuU  +  -(Fi  +  F2u)'^We(Fi  +  Fju)  +  7^  (Mp V  +  Gp  -  Bpu)  (3.65) 
2  ^ 

Requiring  that  the  gradients  Vu 3  and  V., J  both  vanish  as  a  necessary  condition 
for  minimizing  J  leads  to  the  solution 

U  =  h{Bp7-F?WcFi} 

ry  =  ^BpHBp)  IMpv  +  Gp  +  BpHFjWcFi^  (3.66) 

•  H  =  (Wu  +  FjWcFa)"^ 

Some  simple  calculations  with  example  payload  motions  reveal  the  utility  of  this 
formulation  of  the  inverse  kinematics  and  cooperative  actuation  strategy. 


An  Example  Nomina!  Payload  Trajectory 

Perhaps  the  simplest  and  easiest-to-motivatc  scheme  for  prescribing  a 
motion  qp(t)  for  the  payload  is  to  adopt  a  smooth  polynomial  spline 
initial  state  qp(to)  to  the  target  final  state  qp(t/)  of  the  form 


nominal 
from  the 


qp(t)  s=  /(r){qp(t/)  -  qp(to)}  +  qp(to)i 
qp(0  =  /(r){qp(t/)  -  qp(to)}.  />)  = 
^(t)  =  /(r){qp(t/)  -  qp(to)},  /(»■)  = 


where  we  choose  the  particular  shape  function 


/(t)  =T^(10-15r  +  6r*) 

|f.  =  t’(30  -  60r  +  30t*)  (3.68) 

^  =  T(60-180r  +  120r’) 

This  trajectory  can  be  shown  to  be  optimal  for  the  idealized  case  where  we 
consider  only  the  payload  trajectory  and  the  vector  sums  (F,  M)of  the  forces 
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and  moments  appUed  to  the  payload,  without  regard  to  how  these  are  generated; 
Eqs.  (3.67),  (3.68)  can  be  shown  [Junkins  and  Turner  1986]  to  simultaneously 
minimize  the  translational  and  rotational  jerk  iniegrals 

3i=  f  F^Fdt,  and  32=  f  M’^MA 
Aq  •'<0 

subject  to  satisfaction  of  the  third  of  Eqs;  (3.58),  and  the  boundary  conditions: 

qp(<o)  =  specif  ied  initial  position 
qp(to)  =  0 

®  .  (3.69) 

qp  (<y)  =  specified  final  position 

qp  (t/)  =  0 

^  (t/)  =  0 

Since  the  idealized  optimal  trajectory  [Eqs.  (3.67),  (3.68)]  does  not  explicitly 
consider  workspace  constraints,  this  nominal  motion  must  be  checked  to  make  sure 
it  remains  feasible  throughout  the  motion,  and  of  course,  optimality  with  respect 
to  the  entire  systems  dynamics  and  minimization  of  other  performance  measures 
cannot  be  clamed.  These  smooth,  easy-to-compute,  motions  usually  represent 
excellent  starting  solutions,  however,  and  we  elect  to  use  this  family  of  solutions 
to  generate  the  nominal  trajectories  throughout  the  rem^der  of  this  chapter.  A 
typical  example  motion  of  the  system  b  shown  in  Figure  3.2. 

A  Lyapunov  Stable  Tracking  Control  Law 

A  smooth  nominal  (reference)  trajectory  for  the  entire  system  can  be  computed 
using  Eqs.  (3.67),  (3.68),  and  via  inverse  kinematics,  the  left  and  right  robot  joint 
coordinates  are  determined  from  Eqs.  (3.59)~(3.62),  while  the  nominal  (cooperative) 
shoulder  and  elbow  torques  are  determined  from  Eqs.  (3.66).  Thb  is  a  for- 
example  way  to  determine  the  reference  trajectory,  and  can  be  replaced  by  a  more 
appropriate  path-planning  method  in  particular  applications.  However  the  reference 
trajectory  satisfying  the  boundary  conditions  of  Eqs.  (3.69)  b  determined,  we  denote 
all  state  and  control  variables  along  the  reference  trajectory  with  a  subscript  ref. 
Of  course,  in  actual  applications,  we  can  expect  that  the  system  will  not  follow  the 
reference  trajectory  qref(0  exactly  when  we  command  the  control  Urtf(t),  due  to 
model  errors,  external  dbturbances,  and  nonideal  actuation.  We  seek  a  perturbation 
6n  =  function(6q(t),  5q(t))  which  will  guarantee  that  an  intitally  dbturbed  motion 
will  asymptotically  return  to  the  reference  trajectory  in  the  absence  of  model  or 
implementation  errors.  Actually,  it  b  preferable  that  the  control  perturbation  iu 
b  in  output  feedback  form  where  it  depends  only  upon  a  measurable  subset  of  the 
coordinates  and  their  time  derivatives. 

In  view  of  the  four  kinematic  constraints,  we  know  that  a  minimal  coor¬ 
dinate  description  requires  only  three  generalized  coordinates.  By  considering 
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Figure  3.2.  Noziunal  maneuver,  payload  rotation,  and  actuator  torque  trajectories 


(q,q)to  be  functions  of  (qp,qp),  in  the  third  of  Eqs.  (3.58),  we  are  motivated  to 
investigate  the  kinetic  energy 


Tp  =  -qpMpqp 

(3.70) 

and  observe  that 

fp  =  q?Bpu 

(3.71) 

This  motivates  the  Lyapunov  function 

U=|5q?Mp«qp  +  |5q?Kidqp 

(3.72) 

where  5qp  =  qp  —  <lPref(0*  simplest  case  that  qPref(0  = 

is  easy  to  verify  that  the  Lyapunov  function  derivative  is 

constant,  then  it 

U  =  6qp  JBpu  +  Ki5qpj 

(3.73) 

140 


Stability  and  Control  of  Nonlinear  Mtchanical  Syatenns  Ch.  3 


and  selecting  the  bracketed  term  to  equal  -Kjiqp  (so  that  U  is  never  positive),  we 
are  led  to  the  global  stability  condition 

Bpu  =  -[KiSqp  +  K2«4p]  (3  J4) 

Since  Bp  is  a  3  x  4  matrix,  it  is  evident  that  u  is  underdetermined  and  we  are 
free  to  introduce  an  optimization  criterion  to  select  a  particular  control  satisfying 
Eq.  (3.74),  One  attractive  possibility  is  to  mmimize  this  gives  the  minimum 
actuator  torque  controller 

u  =  -Bj  (BpBJ)"  Vi«qp  +  Kj«qp]  (3.75) 

For  the  trajectory  tracking  case,  in  which  we  desire  to  stabilize  the  motion 
with  respect  to  a  prescribed  reference  motion,  the  atuation  is  more  complicated. 
Suppose  that  the  reference  trajectory  qpr«f(<)  and  an  associated  control  Uref(<)  are 
determined  consistent  with  the  system  dynamics  [for  example,  using  Eqs.  (3.59)- 
(3.69)].  Then  it  follows  that  the  payload  dynamics  at  every  instant  on  the  actual 
and  reference  trajectories  satisfy 

Mpqp  +  Gp  =  Bpu  _ 

Mp„,qp„,  +  Gp 

v«f 

and  it  also  follows  that  the  Lyapunov  function  [Eq.  (3.72)]  has  the  time  derivative 

U  =  SqJ  [Bpu  -  Bp,.,u„,  +  Ki^qp  -  SGp  -  «Mp„,^,.,  +  Mp5qp]  (3.77) 

Setting  the  bracketed  term  to  — K25qp  gives  the  stabilizing  control  condition 

Bpu  =  —  [Ki^qp  +  K25qp]  +  [^Gp  +  ^Mp„,qp,,,  —  Mpiqpj  (3.78) 

and  for  the  case  of  minimum  control  torque,  a  particular  solution  of  Eq.  (3.78)  gives 
the  nonlinear  feedback  law 

u  =  BJ(BpBJ)"'  {Bp„,ii..,  -  [Kifqp  +  K25qp] 

+  |5Gp  +  ^Mp..,  —  Mp5qpj  ) 

This  law,  while  guaranteeing  stability  (neglecting  model  errors),  is  cumbersome 
to  implement  due  to  the  det^led  computation  required  to  produce  all  of  the 
nonlinear  terms.  Note  that  the  payload  coordinates  qp  =  [03  Xc3  may  not 

be  directly  measurable.  For  example,  assume  that  the  measurable  quantities  are 
qL  =  (01  02]^  and  qa  =  [06  0s]'^,  and  the  time  derivaties  thereof;  then  it  is  easy 
to  verify  from  geometry  that  the  payload  coordinates  are  computable  as  follows 

fYazJZpl  ~  r  (ls*in94+l4sin9sy-{hMin$i^hsin$:i)  ] 

~  2  (®Q  +  xp)  =  h[{D  +  IsCOSds  -f  Ucosds)  +  (/iCOS0i  +  /2COS02)]  (3*80) 

Vc^  =  2  (I'Q  +  2/p)  =  +  U$in6^)  +  (/isxn0i  +  /2s:n02)] 


Stc  3-5«  Coop^f^tivc 
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„d  th«  time  derirative  w  =  Pe  *<.  »' 

^  iU  an  dternative  to  the  above  developments,  and  to  obtain  a  direct  output  error 
feedback  form  for  the  control  law,  we  can  observe  the  following  kmenatic  form  for 
the  work  rate  of  the  control  torques 

t  =  uih  +  «2^2  +  ueffe  +  “5^5  =  (  ^  }  "  = 

and  it  is  obvious  by  inspection  that  setting  ul  =  -KslOl.  ur  =  -Ksaqa  will 
decree  T  for  all  nonzero  motion  of  the  system.  This  energy  d^ipative  control 
suggests  the  following  output  error  feedback  law  for  controlhng  the  departure  motion 
relative  to  the  reference  trajectory 

=  (3.8.) 

^  •  V  _  [KiL  0  ■ 

where  the  4  x  4  positive  definite  g^n  matrices  have  the  structure  Ki  -  q  Kjr  . 

It  can  be  verified  that  the  control  law  of  Eq.  (3.82)  is  guaranteed  to  be  global . 
stabilizing  only  for  the  case  that  qref  =  constant.  While  global  asymptitic  stability 
is  not  guaranteed  during  the  time  interval  {to  <  <  <  ^  guaranteed  during 

the  interval  {t  >  t/},  for  all  reference  maneuvers  satisfymg  the  boundary  conditions 
of  Eq.  (3.69).  These  developments  can  be  better  appreciated  in  the  light  of  some 
illustrative  numerical  examples,  as  provided  in  the  next  section. 
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Cooperative  Control:  A  Numerical  Example 

To  illustrate  the  above  discussion,  we  consider  each  link  of  the  robots  to  be  1  m 
long  and  to  have  a  mass  of  1  kg.  The  distance  D  between  the  shoulder  joints  is 
taken  as  0.75  m,  and  the  nominal  initial  and  desired  target  values  of  five  angles  are 
listed  in  Table  3.1.  The  inverse  kinematic  process  of  Eqs.  (3.59)-(3.69)  was  used 
to  compute  the  solution  shown  in  Figure  3.2.  All  the  intial  conditions  were  then 
perturbed  by  moderately  large  angles  (order  of  10°),  and  the  feedback  control  law 

of  Eq.  (3.82)  was  used.  ,  . 

A  typical  controlled  response  from  large  inital  disturbances  is  shown  m  Fig¬ 
ure  3.3.  Notice  that  the  order  of  10“  initial  errors  are  less  than  0.5“  by  the  nominal 
final  time  of  10  s;  however,  a  few  more  seconds  of  terminal  control  are  required  to  ef¬ 
fectively  null  the  errors.  The  weight  matrices  [in  Eq.  (3.64)]  were  W„  =  I,  We  =  0, 
and  the  control  gains  [in  Eq.  (3.82)]  were  Kx  =  0.51,  Kj  =  0.21;  these  affect  the 
controlled  response,  however  we  found  a  large  family  of  feasible  valu«.  Froin  eval¬ 
uating  the  response  using  several  other  intitial  conditions  and  variation  in  the 
selections  of  the  control  gains  and  weight  matrices,  we  confirmed  that  a  wide  r^ge 
of  choices  give  excellent  tracking  stability  over  a  large  domain  of  intital  conditon 
errors.  Thus  the  control  law  of  Eq.  (3.82)  seems  to  be  an  attractive  candidate  for 
practical  applications. 
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Table  3-1-  Initial  and  final  angles  for  the  nominal  maneuver 

ei[dag]  Ojldcg]  Ozldeg]  O^ldcg]  es[deg]  time[s] 

initial:  121.0430  40.0323  00.0000  58.9570  139.9677  0 

target:  137.2041  -10.3342  90.0000  117.3017  142.3095  10 

The  above  results  have  been  extended  to  more  general  multilink  configurations, 
including  base  motion,  and  they  have  beeen  successfully  validated  in  an  experimen¬ 
tal  study  [Yale  1993],  including  consideration  of  the  case  of  the  robot  arms  mounted 
on  a  movable  base. 


Figure  3.3.  Controlled  response  &om  disturbed  initial  conditions 
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3.6  DYNAMICS.  STABILITY.  AND  CONTROL 
OF  A  DISTRIBUTED  PARAMETER  SYSTEM 

In  Figure  3.4  we  consider  control  of  a  rigid  hub  with  four  cantilevered  flexible 
appendages.  We  consider  the  appendages  to  be  identical  uniform  flexible  beams 
and  make  the  Euler-Bernoulli  assumptions  of  negligible  shear  deformation  and 
distributed  rotary  inertia.  Each  beam  is  cantilevered  rigidly  to  the  hub  and  has 
a  flnite  tip  mass.  Motion  is  restricted  to  the  horizontal  plane  and,  control  torque 
u(t)  acting  on  the  hub  is  the  only  external  effect  considered. 

We  are  interested  in  a  class  of  rest-to-rest  maneuvers,  and  under  the  previously 
mentioned  assumptions,  we  can  show  that  the  beams  will  deform  in  the  antisym¬ 
metric  fashion  (Figure  3.4),  with  the  configuration’s  instantaneous  mass  center  re¬ 
maining  at  the  hub’s  geometric  center.  Also,  because  of  the  assumed  antisymmetric 
deformation  of  the  beams,  in  this  section  we  need  to  concern  ourselves  only  with 
the  deformation  y(x,  t)  of  a  single  beam.  We  subsequently  relax  this  restriction,  to 
permit  more  general  kinematic  assumptions  and  the  analysis  that  flows  form  it.  We 
adopt  the  continuum  viewpoint  and  avoid  introducing  spatial  approximations  in  the 
application  of  Lyapunov  stability  concepts;  the  resulting  control  law  and  stability 
arguments  will  therefore  apply  rigorously  to  the  distributed  parameter  system.  The 
hybrid  system  of  ordinary  and  partial  differential  equations  governing  the  dynamics 
of  this  system  is  readily  obtained  from  Hamilton’s  principle  to  be  [Junkins  1993] 


Ihub  =  u  4*  4(Mo  —  SoLo) 

-(Mo-SoLo)  =  +  +  hot 

+  +  EI0  =  O+HOT 

(3.83) 

where 


P 

El 

(Mo.  So) 

e 


mt 

(L.Lo) 


assumed  constant  mass/unit  length  of  the  beams 
assumed  constant  bending  stiffness  of  the  beams 
bending  moment  and  shear  force  at  the  root  of  the  beam 
hub  inertial  rotation 
mass  of  the  tip  mass 

distance  from  the  hub  center  to  the  beam  tip  and  the  hub  radius 


In  Eq.  (3.83),  we  denote  higher-order  terms  by  HOT  to  indicate  other  known  linear 
and  nonlinear  effects  (such  as  rotational  stiffening,  and  shear  deformation).  The 
most  fundamental  developments  do  not  consider  these  higher-order  effects;  however, 
we  selectively  discuss  the  generalizations  that  accommodate  these  effects  as  well.  Of 
course,  in  general,  there  are  unknown  model  errors  and  disturbances  as  well,  and  a 
practical  control  scheme  must  be  stable  in  the  presence  of  reasonable  model  errors. 
The  boundary  conditions  on  Eqs.  (3.83)  are: 


at  x  =  Lo:  y(t.Lo)  =  |^l  =0  (clamped  beam  geometric  B.C.s) 

ILo 

at  x.=  L:  =  °  (moment)  (3.84) 

The  total  energy  of  the  system  (constant  in  the  absence  of  control  or  distur¬ 
bances)  is 


l  =  Ihub(^) 


(3.85) 


di  .  «i 

At  ^  flt 
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Motivated  by  results  published  in  the  recent  literature  (Refs.  3-5,19,21,22), 
we  investigate  the  following  weighted  energy  function  as  a  candidate  Lyapunov 
function: 


+4a3 


rL 

JLo 


(3.86) 


V 


where  the  positive  weighting  coefficients  a;  are  included  to  allow  relative  emphasis 
on  the  three  contributors  to  the  ^error  energy”  of  the  system.  Note  that  this 
is  one  of  many  possible  ways  to  weight  the  mechanical  system  error  energy,  and 
merely  provides  one  illustration  of  an  approach.  It  is  physically  reasonable  to 
consider  placing  relative  emphasis  upon  dissipating  subsets  of  mechanical  energy 
as  a  control  strategy,  because  some  energy  subsets  are  obviously  more  degrading  of 
system  performance  objectives  than  others  in  practical  applications.  Since  6  does 
not  appear  in  the  total  mechanical  energy  of  Eq.  (3.85),  the  total  energy  of  Eq.  (3.85) 
is  only  positive  semidefinite.  We  have  added  the  positive  “torsional  spring  energy” 
term  a2{6  —  BjY  in  Eq.(3.86)  as  a  pseudoenergy  to  make  the  target  final  state 


be  the  global  minimum  of  U.  It  is  obvious  by  inspection  that  imposing  a;  >  0  in 
Eq.  (3.86)  guarantees  that  U  >  0  and  that  indeed  the  global  minimum  of  U  =  0 
occurs  only  at  the  desired  state  (we  wish  to  begin  at  rest  and  rotate  to  a  new 
angular  position  suppressing  vibration  enroute  and  returning  to  zero  flexural 

deformation  in  the  final  position).  Differentiation  of  Eq.  (3.86),  substitution  of  the 
equations  of  motion  [Eqs.  (3.83)  and  (3.84)],  and  considerable  calculus  leads  to  the 
weighted  power 

dU  •  r  1 

U  =  =  Q  ^  1x2(5  —  5/)  +  4(03  —  ax)(LoSo  —  Mo)J  (3.87) 

Since  we  require  that  U  <  0  to  guarantee  stability,  we  set  the  term  in  brackets  to 
—045,  and  this  leads  to  the  control  law 

'i  =  1^2(5  —  5y)  +  04!?  +  4(03  —  cii)(LoSo  ““  Mo)j  (3.88) 

In  [Oh  1992],  we  developed  a  shortcut  based  upon  the  work/energy  rate  method 
that  avoids  most  of  the  algebra  and  calculus  required  to  establish  the  weighted 
power  expressions  like  Eq.  (3.87),  we  could  make  use  of  this  idea  here  to  arrive 
more  efficiently  at  Eq.  (3.87). 

From  Eqs,  (3.87)  and  (3.88),  and  considering  all  possible  values  for  the  a*,  we  see 
that  the  following  linear,  spatially  discrete  output  feedback  law  globally  stabilizes 
this  distributed-parameter  system: 
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U  =  -  [ffi  (5  -  0/)  +  92^  +  j3(LoSo  -  Mo)] ;  (Z.S9) 

9i  >0,  92  >  0.  93  >  -4  for  stability 

This  control  law  is  elegant.  Notice  that  the  rigorous  stabUity  proof  does  not  depend 
on  introducing  spatial  discretization  methods  such  as  the  fimte  element  method. 
Furthermore,  we  have  verified  from  root  locus  calculations  that  the  g^  stabdxty 
boundaries  are  apparently  exact  in  this  case  (to  10  digits  for  the  first  10  eig^v^ues) 
Of  important  practical  consequence,  notice  that  controllers  based  on  tl^  law  of 
Eq  (3  89)  are  easy  to  implement  since  no  state  estimation  is  required.  The  root 
shear  and  bending  moment  can  be  measured  by  using  conventional  steam  gaug«. 
The  value  and  sign  of  the  shear/moment  feedback  gain  53  =  4(a3  -  ai)/ai  depends 
on  whether  we  wish  to  emphasize  dissipation  of  the  beam  vibration  ener^  (for 
03  >  ai)  or  the  energy  of  hub  motion  (for  03  <  ai),  as  is  evident  from  Eq.  (3.86). 

Since  U  =  is  not  an  explicit,  negative  definite  function  of  the  subset  of 

state  variables  o  / 

the  stability  arguments  implicitly  depend  on  the  truth  ihai  all  infini^  of  the 
antisymmetric  modes  of  motion  of  this  structure,  have  generally  nonzero  hub 
velocity  (6).  Note  under  the  kinematic  assumptions  leading  to  Eqs.  (3.83), 
only  antisymmetric  modes  are  present,  and  no  nontrivial  motion  can  exist  while 
the  hub  angular  velocity  vanishes  identically  for  finite  time  intervals.  A  more 
elegant  proof  of  global  asymptotic  stability  using  the  feedback  law  of  Eq.  (3.89) 
can  be  done  by  applying  Theorem  3.9.  This  has  been  carried  to  completion 
in  [Mukherjee  1992],  including  consideration  of  the  cas«  in  which  we  relax  the 
antisymmetric  deformation  assumption  applied  in  deriving  Eqs.  (3.83),  thereby 
admitting  a  richer  and  more  general  set  of  motions  (the  four  beams  are  described 
by  four  distinct  functions  of  space  and  time,  and  there  are  now  four  PD&  and  one 
hybrid  differential/integral  equation).  For  this  more  general  configuration,  it 
be  shown  that  a  single  hub  actuator  cannot  provide  rigorous  asymptotic  stability, 
because  only  an  antisymmetric  subset  of  the  modes  are  controllable  by  a  hub 
actuator  (physically/qualitatively,  the  uncontrollable  modes  have  identical  adjacent 
beams  moving  in  opposition,  which  results  in  equal  and  opposite  root  moments  and, 
because  of  this  cancellation,  zero  hub  motion).  For  rest-to-rest  maneuvers,  however, 
only  the  antisymmetric  modes  considered  here  are  disturbable  (by  a  hub  torque 
actuator),  and  they  are  also  controllable.  Thus,  for  the  assumptions/constr^nts 
imposed  in  deriving  the  differential  equation  model  developed  above,  the  control 
law  of  Eq.  (3.89)  b  globally  stabilizing. 

It  b  significant  that  thb  same  linear  feedback  law  of  Eq.  (3.89)  maintains 
its  globaUy  stabilizing  character  even  when  the  Euler-BernouUi  assumptions  are 
relaxed  to  include  the  most  common  additional  linear  and  nonlinear  effects.  In 
particular,  we  have  verified  that  closed-loop  stability  b  maintained  when  we  include 
the  following:  rotational  stiffening,  Coriolb  kinematic  coupling  terms,  aerodynamic 
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drag,  shear  deformation,  beam  rotary  inertia,  and  finite  inertia  of  the  tip  ma^. 
The  verification  of  these  truths  requires  appropriate  modifications  of  the  kinetic 
and  potential  energy  functions  and,  of  course,  the  differential  equations  of  motion 
musfbe  generalued  consistently.  In  particular,  U  =  «n 

conditions  0  =  0,  0  =  0  can  be  encountered  at  some  pomt  other  than  U-0  (the 
Urget  state),  so  the  nonlinear  proof  proceeds  directly  from  the  closed-loop  system 
diffiential  ^nations  by  showing  that  the  condition  6  =  6  =  0  occurs  only  at  the 

desired  equilibrium: 

5y(x,t) 


In  shoH,  global  siabiliiy  of  the  system  using  the  simple  linear  output  feedback  control 
law  of  Eq.  (S.89)  has  been  found  to  be  very  forgiving  of  the  usual  vanatioM  in 
modeling  assumptions  and,  therefore,  modeling  errors.  In  t^  section,  an  mdirect 
method  of  Lyapunov  for  analyzing  the  motion  of  a  nonlinear  system 
equilibrium  state  has  been  presented,  and  also  a  method  for  generating  global  y 
stabilizing  feedback  control  law  for  distributed-parameter  structural  systems  has 
been  discussed  as  an  important  application  of  Lyapunov  direct  method. 

We  have  discussed  the  vibration  suppression  problem  of  the  hub-appendage 
configuration  in  the  previous  sections.  As  discussed  above,  the  constant  gam  Imear 
feedback  control  law  works  poorly  if  we  try  to  use  the  same  constant  gams  for 
both  large  angular  motions_and  for  small  terminal  motions.  This  is  becai^e  the 
large  gains  required  for  effective  vibration  suppression  and  disturbance  rejection 
to  accurately  isolate  the  target  state  are  typically  several  orders  of  magnitude  too 
large  for  the  en-route  portion  of  the  maneuver  (i.e.,  the  large  gams  appropriate  for 
vibration  suppression,  when  used  during  a  large-angle  maneuver,  typically  r«ult 
in  significant  6  overshoots  and,  often,  actuator  saturation).  Also,  the  large  initid 
torque  command  typically  introduces  a  large  vibratory  transient  into  highly  fl^ble 
structures.  From  a  qualitative  point  of  view,  if  we  wish  to  maneuver  a  highly 
flexible  structure  while  suppressing  vibration,  then  it  is  unlikely  that  we  should 
initiate  this  process  by  hitting  the  structure  with  a  large  hammer!  To  obtain  a 
control  law  more  appropriate  for  near-minimum-time  large-angle  maneuvers  with 
vibration  suppression,  stable  tracking-type  feedback  control  laws  discussed  in  this 

section  can  be  applied.  .  ,  j  w  i. 

Consider  briefly  the  near-minimum-time  maneuver  of  a  rigid  body.  We  know 
that  the  strict  minimum-time  control  is  a  bang-bang  law  which,  for  the  rest- 
to-rest  maneuver-to-the  ori^n  case,  saturates  negatively  during  the  first  half  of 
the  maneuver  and  positively  during  the  last  half  of  the  maneuver  [Junkins  1986, 
1991, 1993],  [Meirovitch  1987],  [Singh  1989],  [Breakwell  1981],  [Slotine  1991],  [Van- 
derVelde  1983].  From  an  implementation  point  of  view,  the  instantaneous  switches 
of  the  bang-bang  law  are  sometimes  troublesome  because  (1)  no  torque-generating 
device  exists  that  can  switch  instanteneously;  (2)  when  generdized  and  applied  to 
a  flexible  structure,  the  bang-bang  class  of  controls  excite  poorly  modeled 
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higher  modes;  and  (3)  the  switch  times  (and,  therefore,  the  dynamics  of  the  actual 
system)  are  usually  very  sensitive  to  modeling  errors. 

An  attractive  family  of  paramtUriztd  sharpness  approximations  of  the  switch 
function  has  been  introduced  to  modify  the  admissible  controls  in  near-minimum- 
time  control  formulations.  The  approximation  presented  in  [Thompson  1989]  and 
[Byers  1990],  involves  transcendental  functions,  but  recent  analytical/experimental 
work  [Junkins  1991, 1993]  indicates  that  a  much  simpler  piecewise  continuous  spline 
appro3dmation  of  the  switching  function  is  attractive  from  an  implementation  point 
of  view.  Using  this  approach,  a  typical  near-minimum-time  control  law  (for  single 
axis,  rest-to-rest  maneuver  of  a  rigid  body)  has  the  form 

W  =  u  =  ±u„„/(At,  tf ,  t)  (3.90) 


where  tf  is  the  maneuver  time  and  a  =  At/tf.  We  choose  the  (+)  sign  if  fif  >  6c- 
As  a  torque  shaping  function,  we  adopt  the  smooth  sign  function  approximation 
/(At,tf,t): 


/(At,tf,t) 


for  0  <  t  <  At 

\At/  1  V.At/j’ 

1. 

for  At  <  t  <  Y  -  At  =  ti 

forti  <t<  ^  +  At  =  t2 

-1, 

for  t2  <  t  <  ^  —  At  =  t3 

for  t3  <  t  <  tf 

Adopting  the  positive  sign,  Eq.  (3.90)  integrates  to  yield 

e{t)  =  flo  +  /(At,  tf ,  r)dr 

^(t)  =  ^0  +  (t  -  to)0o  +  /t‘  /,7  /(At,  tf ,  T2)dr2dri 


(3.91a) 

(3.91b) 


The  integrations  in  Eqs.  (3.91)  can  be  carried  out  in  terms  of  elementary 
functions,  which  are  not  presented  here  for  the  sake  of  brevity;  the  resets  of  these 
integrations  give  Eqs.  (3,93),  (3.94)  below.  Figure  3.5  shows  a  maneuver  resulting 
from  these  integrations  for  a  typical  selection  of  parameters  (a  =  0.25,  Umax  =  400 
oz-in.),  and  a  40®  rest-to-rest  maneuver  of  a  rigid  approximation  of  the  structure 
in  Figure  3.4  and  Table  3.2.  For  rest-to-rest  maneuvers,  we  impose  the  boundary 
conditions: 


at  to  =  0  :  6(0)  =  ^o,  ®(0)  =  0 

at  time  tf:  fl(tf)  =  fiy,  0(tf)  =  O 


(3.92) 


and  upon  carrying  out  the  integrations  implied  in  Eq.  (3.91),  we  obtain  the  useful 
relationship 
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Table  3.2.  Texas  AitM  maneuverable  flexible  structure;  configuration  parameters 


Total  undeforraed  system  inertia,  I 
Hub  radius,  Lq 
Hub  center  to  tip  mass,  L 
Tip  mass,  mt 

Appendage  modules  of  elasticity,  E 
Inertia  of  bending  section,  I 
Mass  density  of  appendage/length,  p 


2128,  oz-s -in. 
5.5470,  in. 

51.07,  in. 

0.15627.  oz-sVin. 
161.6x10®,  oz/va? 
0.000813,  in.® 
0.003007,  oz-s^/in.^ 
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=  Al  =  ««r,  0<«<i  (3.93) 

.  _  / _ I(g/  -  6o) _ 

\  u„ax[(l/4)  -  (l/2)a  +  (1/I0)a2] 

In  Eq.  (3.94),  we  see  the  explicit  tradeoff  between  torque  shaping  a,  target 
maneuver  time  tf,  maneuver  angle  Of  ^  6q,  and  maximum  angular  acceleration 
Umax/I-  Obviously,  Eq.  (3.93)  can  be  inverted  for  any  of  these  as  a  function  of 
the  remaining  parameters.  If  we  set  a  =  At/tf  =  0,  of  course,  we  obtdn  the 
well  known  special  case  result  expressing  the  relationship  between  the  minimum 
time,  maneuver  angle,  inertia,  and  saturation  torque  for  bang-bang  control-  It 
is  obvious  that  selection  of  a  controls  the  sharpness  of  the  switches,  vdth  a  =  0 
corresponding  to  bang-bang  control  (instantaneous  switches)  and  a  =  0.25  being 
the  smoothest  member  of  this  family  of  torque-shaped  maneuvers.  Figure  3.6  shows 
the  rigid  body  maneuver  time  tf  vs  a,  from  Eq.  (3.94),  whereas  Figure  3.7  shows  the 
residual  total  energy  (at  time  tf)  when  the  torque-command  Uref  =  Uniax/(oftf,tf,  t) 
is  applied  to  simulate  the  flexible  body  response  [first  six  modes  from  a  discrete 
assumed  mode  model  (Chapter  4  of  [Junkins  1993]  of  order  20).  Notice  (Figure 
3.7)  that  open-loop  torque  shaping  reduces  residual  vibration  at  time  tf  by  three 
orders  of  magnitude  (a  =  0.1)  with  only  a  modest  ten  percent  increase  over  the 
theoretical  minimum  time  rigid  body  maneuver  (or  =  0).  The  preceding  results 
and  [Junkins  1991, 1993],  [Thompson  1989],  [Vadali  1990],  and  [VanderVelde  1990], 
support  the  intuitively  obvious  truth  that  applying  judiciously  smoothed  bang-bang 
controls  such  as  Eq.  (3.90)  to  generate  an  open-loop  maneuver  of  a  flexible  body 
ran  result  in  nesLT  negligible  structural  vibration  for  sufficiently  slow  maneuvers 
(small  Umax  and  large  a)  and  neglecting  disturbance  torques.  Of  course,  unmodeled 
disturbances,  control  implementation  errors,  and  model  errors  can  be  expected  to 
negate  some  of  these  apparent  gains.  However,  sharper  control  switches  obviously 
increase  the  probability  that  higher  frequency,  less  well  modeled  modes  will  be 
excited  and,  therefore,  robustness  with  respect  to  model  errors  is  generally  more 
of  an  issue  for  bang-bang  control  than  for  smoother  torque  profiles.  Even  for 
relatively  small  departures  (slightly  smoothed  switches)  from  bang-bang  control, 
torque-shaped  maneuvers  of  highly  flexible  structures  typically  enjoy  a  reduction  of 
several  orders  of  magnitude  in  residual  vibration.  Thus,  the  overall  maneuver  time 
(including  terminal  vibration  suppression)  can  be  reduced  significantly  by  torque 
shaping. 

These  observations  suggest  the  following  strategy:  Use  an  optimized  shaped- 
input  profile  to  establish  a  “trackable”  a  priori  reference  rigid  (or  reduced-order 
flexible)  body  maneuver;  then,  based  on  real-time  measurements  of  the  actual  flexi¬ 
ble  body’s  departure  from  this  smooth  reference  motion,  superimpose  a  perturbation 
feedbadc  control  on  the  reference  shaped-torque  history  that  stabilizes  the  depar¬ 
ture  motion  from  the  reference  motion.  Also  of  significance,  it  is  usually  desirable 
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Figure  3.6.  lU^d  body  maneuver  time  vs  satiiration  toxx^ue  and  torque-shaped  parameter 
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Figure  3.7*  Flexible  body  open-loop  residual  vibration  energy  vs  saturation  torque  and  torque- 
shaped  parameter 
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to  select  the  reference  torque  profile  parameters  Umax»  o,  etc.)  to  consider  the 
available  sensor  and  actuator  dynamics  and  thereby  make  the  commanded  torque 
history  more  nearly  achievable  physically. 

Pursuing  this  logic  judiciously,  attractive  tracking-type  feedback  control  laws 
can  be  established  for  near-minimum-time,  large  angle  maneuvers.  Since  bang- 
bang  flexible  body  controllers  are  sensitive  to  modeling  and  control  implementation 
errors,  we  seek  control  laws  that  are  a  smooth  torque-shaped  compromise  between 
the  competing  objectives  of  minimizing:  (1)  maneuver  time,  (2)  residual  vibration, 
and  (3)  sensitivity  of  closed-loop  performance  measures  with  respect  to  model  and 
control  implementation  errors. 

We  adopt  a  reference  rigid  body  maneuver  {0r«fWi5r«f(t),^rcf(t)  = 
satisfying  Eqs.  (3.90)-(3.94),  where  I  is  the  undeformed  moment  of  inertia  of  the 
structure,  and  we  have  implicitly  selected  or,  Umax  computed  the  corresponding 
tf  from  Eq.  (3.94)  for  specified  initial  and  final  angles.  For  designing  a  globally 
stable  tracking  controller,  the  candidate  error  energy  Lyapimov  function  can  be 
established  by  considering  Eq.  (3.86)  as 

2U  =  ailhubW*  +  02^6^  +  403]  £  +  xW]  ’dx 

2  ^  2  "I  (3-95) 

+  dx  +  mt[LW  +  «|f|J  I 

where  5(  )  H  (  )  —  (  )r  and  the  (  )r  quantities  are  evaluated  along  the  open-loop 
flexible  body  solution  of  Eqs.  (3.83)  with  u(t)  =  Uref(t).  Considering  Eqs.  (3.87) 
and,  the  time  derivative  of  U  is  given  by 

U  =  (0  -  0r)|  fllU  -  CiUref  +  a2(0  -  ^r) 

^  ^  (3.96) 

+4(03  -  ai)((LoSo  -  Mo)  -  (LoSo  -  Mo)r]} 

Pursuing  the  objective  of  globally  stable  control,  it  is  clear  that  setting  the 
[  ]  term  equal  to  -04(5  —  0^)  leads  to  the  following  globally  stabilizing  [with 
U  =  —04(5  —  control  law: 

u  =  Urcf (t)  —  |gi(^  —  ^r)  +  Z20  —  ^r)  +  gsKLoSo  —  Mq)  ~  (LqSo  —  Mo)r]}  (3.97) 

To  enable  easy  implementations,  the  following  structure  for  a  tracking  control  law 
can  be  hypothesized: 

u  =  Upef(t)  — |gi(0  — 0n!f)+g2(5  — tfr^)+g3[(LoSo  “  Mq)  —  (LoSq  —  Mo)ref]  (3.98) 

where  it  is  easy  to  show  that  the  root  moment  for  the  special  case  of  a  reference 
(rigid  body)  motion  is  proportional  to  the  angular  acceleration: 

(LqSo  “  Mo)ref  ^  ”  ^o)/3  + 


(3.99) 
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Obviously,  the  globally  stabilizing  control  law  of  Eq.  (3.97)  is  similar  to  the 
conjectured  law  (for  practical  implementation)  of  Eq.  (3.98),  the  difference  being 
that  Eq.  (3.98)  requires  presolution  for  the  open-loop  rigid  body  (  )ref  quantities, 
whereas,  the  globally  stabilizing  control  law  of  Eq.  (3.97)  requires  solution"  for 
the  open-loop  flexible  body  (  ),  quantities  from  the  partial  differential  equations. 
Since  near-minimum-time  control  implies  a  certain  urgency(!),  it  is  obvious  that  the 
negli^ble  computational  overhead  of  EJq.  (3.98)  is  more  attractive  than  Eq.  (3.97) 
from  the  point  of  view  of  real-time  implementations.  For  the  purpose  of  finding  the 
region  possessing  Lyapunov  stability,  substitute  Eq.  (3.98)  into  Eq.  (3.96) 


ij  =  -0i(«  -  5r){g2(^  -  fir)  +  [gi Afi  -1-  g2A5  +  g3A(LoSo  -  Mq)]  }  (3.100) 

The  Lyapunov  stability  condition  comes  from  requiring  U  of  Eq.  (3.100)  to  be 
negative;  a  sufficient  condition  is 

\e-6r\>  — Igi  A5  +  g2 A?  +  g3 A(LoSo  -  Mo)|  (3.101) 

If  the  angular  velocity  tracking  error  \6  —  firl  exceeds  /x,  then  V  is  negative  and 
apparently  U  decreases  until  encountering  the  region  bounded  by  Eq.  (3.101).  It  is 
further  apparent  that  the  A  quantities  on  the  right  side  of  Eq.  (3.101)  are  finite  and 
(pre-)computable  differences  between  open-loop  flexible  (  )r  and  rigid  body  (  )ref 
motions.  Thus,  an  upper  bound  /x  can  be  established  directly  by  precomputation  of 
a  family  of  two  open-loop  motions  and  the  use  of  a  particular  set  of  feedback  gains. 
Equation  (3.101)  thus  determines  an  angular  velocity  variable  boundary  defining 
a  region  F  near  the  (  )ref  motion.  Note  that  large  motions  are  globally  attracted 
to  r  because  U  <  0  everywhere  outside  of  this  region.  Thus,  the  control  law 
of  Eq.  (3.98)  is  almost  globally  stabilizing,  and  the  only  region  where  asymptotic 
stability  is  not  guaranteed  is  the  small  F  boundary  layer  region  near  the  target 
trajectory.  Furthermore,  the  right  side  of  Eq.  (3.101)  is  essentially  a  measure  of 
how  nearly  the  reference  target  trajectory  satisfies  the  flexible  body  equations  of 
motion;  a  judicious  choice  of  the  shaping  parameters  defining  the  target  trajectory 
and  the  associated  reference  control  input  can  usually  be  made  to  result  in  /x  (and 
therefore  F)  being  sufficiently  small. 

A  bounded-input/bounded-output  (BIBO)  viewpoint  of  stability  can  be  used 
to  establish  some  insight  into  the  motion  in  the  F  region.  Departure  motion 
differential  equations  for  5(  )  =  (  )— (  )r  quantities  can  be  obtained  by  differencing 
Eiqs.  (3.83),  driven  by  the  control  law  of  Eq.  (3.98),  from  the  rigid  body  equations  of 
motion,  driven  by  u^f .  Upon  formulating  these  equations,  one  can  verify  that  the 
departure  motion  is  governed  by  a  linear,  otherwise  asymptotically  stable,  system 
of  differential  equations,  forced  by  the  known  A  terms  that  appear  in  Eq.  (3.101). 
The  6(  )  motion  in  the  F  region  is  thus  bounded  because  the  A  forcing  terms 
are  bounded;  the  finite  maxima  of  these  terms  can  be  found  by  direct  calculation. 
The  resulting  departure  motion  is  therefore  bounded  everywhere  in  the  F  region, 
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which  was  already  known  to  have  a  (typically  small)  finite  dimension  /x.  Since 
the  actual  numerical  bounds  on  the  A  and  quantities  can  be  made  arbitrarily 
small  (depending  on  how  nearly  the  user-defined  reference  trajectory  is  made  to 
satisfy  the  open-loop  equations  of  motion),  we  have  a  very  elegant  theoretical  and 
practical  situation  vis-a-vis  stability  of  the  closed-loop  tracking  motion.  We  see 
that  the  closed-loop  motion  is  globally  attracted  to  the  controUably  small  T  region 
near  the  target  trajectory  and,  considering  the  motions  within  F,  we  have  BIBO 
stability. 

In  this  application,  we  use  a  torque-shaped  rigid  body  reference  trajectory,  which 
is  very  attractive  since  the  reference  maneuver  can  be  calculated  in  closed  form  [such 
as  the  family  of  Eqs.  (3.90)-(3.96)]  and  since  the  ensiling  tracking  law  performs 
extremely  well.  Note  that  ^s.  (3.90)-(3.96)  have  a  continuous  transition  to  the 
final  fixed  state: 

■^^rcf(0>  ^rcf(t)i  ^pef(0»  I^^o(t)]refi  [So(t)]ref^  =  0|  0|  aS  t  — ♦  tf 

SO  that,  for  t  >  tf ,  only  the  three  feedback  terms  of  Eqs.  (3.98)  are  contributing 
to  the  terminal  fine-pointing/vibration  arrest  control.  Thus,  the  controls  blend 
continuously  from  the  large-angle  tracking  law  of  Eq.  (3.98)  into  a  constant  gain 
controller  (for  t  >  tf)  identical  to  the  globally  stable  fixed  point  output  feedback 
case  of  Eq.  (3.88).  Thus  we  have  unqualified  global  stability  for  t  >  tf . 

Simulated  Results  for  Large  Angle  Maneuvers 

Returning  to  the  family  of  40®  open-loop  maneuvers  used  to  generate  the  energy 
surface  of  Figure  3.7,  we  computed  the  velocity  tracking  bound  p  for  Lyapunov 
stability  [as  given  by  Eq.  (3.101)]  and  found  the  maximum  value  (pmax)  of  M(t) 
along  each  trajectory.  Figure  3.8  displays  this  worst-case  tracking  bound  (maximum 
value  of  p)  surface  /Xmax(ct|  Umax)  region  used  to  generate  Figures  3.6  and  3.7.  The 
closed-loop  tracking  error  bound  has  a  roughly  analogous  behavior  to  the  open- 
loop  residual  vibration  energy  surface  of  Figure  3.7.  Recall  that,  outside  the  region 
bounded  by  the  inequality  of  Eq.  (3.81),  we  have  guaranteed  Lyapunov  stability, 
using  the  control  law  of  EJq.  (3.98)  and  the  reference  rigid-body  torque  given  by 
Eqs.  (3.90)-(3.94).  From  Figure  3.7,  it  is  clear  that  sufficiently  small  pmnx  and 
large  a  result  in  arbitrarily  small  tracking  errors,  but  the  (small  a,  large  Umax) 
near-bang  reference  maneuvers  cannot  be  tracked  as  precisely.  It  is  easy  to  see 
how  a  subset  of  the  candidate  (a,  Umax)  designs  can  be  found  that  satisfy  specified 
inequalities  on  maneuver  times,  tracking  errors,  and  residual  vibration  energy  by 
direct  examination  of  the  surfaces  of  Figures  3.6-3.9. 

The  results  obtained  from  the  simulations  (and  in  the  actual  hardware  imple¬ 
mentations  discussed  later  and  in  [Junkins  1991,1993])  support  the  conclusion  that 
these  surfaces  can  be  used  to  establish  a  large  region  of  feasible  designs  for  near- 
minimum-time  controls  in  the  space  of  torque-shaped  parameters  and  control  gains. 
Optimization  over  the  set  of  feasible  designs  should,  in  general,  include  considera¬ 
tion  of  the  nature  of  expected  disturbances  to  be  rejected.  One  detailed  simulation 
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Figure  3.8.  Botmdaiy  of  the  Lyapunov-stable  trackiiig  regioa  vs  saturation  torq[ue  and  torque¬ 
shaped  parameter 


Figure  3,9.  Open^loop  40®  maneuver  with  random  disturbances 
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is  now  considered  to  show  state  and  control  variable  histories  along  a  typical  trajec¬ 
tory  of  the  family  of  trajectories  underlying  the  above  surfaces.  In  these  simulations 
the  effects  of  worst-case  disturbance  torques  are  included  in  order  to  illustrate  the 
effectiveness  of  controls  in  the  presence  of  unmodeled  effects.  For  simplicity/only 
the  case  of  40®  rest-to-rest  maneuver  is  considered  here,  along  with  setting  Uniax= 
400  oz-in.  for  all  cases. 

For  the  computational  studies,  two  control  laws  are  considered:  namely,  the 
output  feedback  law  (control  law  I)  of  Eq.  (3.88),  and  the  tracking-type  feedback 
control  law  (control  law  II)  of  Eq.  (3.98).  Although  control  law  II  could  be  used 
with  an  arbitrary  reference  trajectory,  the  torque-shaped  rigid  body  trajectories  of 
Eqs.  (3.90)-(3.94)  are  spedfically  selected  for  investigation.  The  torque-shaped 
open-loop  control  lustory  Uref  can  be  precomputed  (in  a  fraction  of  a  second!) 
from  Eqs.  (3.90)-^3.94)  and  stored,  whereas  the  instantaneous  trajectory  variables 
{ffr«fi®r«f>lLoSo(t)-Mo(t)]r«f}  are  integrated  easily  in  real  time.  Note  that  the 
boundary  conditions  of  Eqs,  (3.92)  are  enforced  by  using  Eq.  (3.94)  to  compute  the 
trajectory  maneuver  time  as  a  function  of  the  maneuver  angle,  saturation  torque, 
and  torque-shaped  parameter. 

We  now  discuss  the  simulation  results  using  control  law  II,  which  obviously 
blends  into  control  law  I  in  the  end  game  (for  t  >  tf ).  In  the  experimental  results  in 
the  subsequent  discussion,  maneuvers  carried  out  by  both  control  laws  are  reported. 
Both  open-loop  (all  gi  =  0)  and  closed-loop  time  histories  of  selected  state  variables 
are  shown  in  Figures  3.9  and  3.10. 

Figures  3.9(a)  and  (b)  show  the  hub  angle  and  angular  velocity  for  the  case  of  an 
open-loop  control  and  in  the  presence  of  substantial  impulsive  and  quasirandom  (5 
oz-in.,  lo-)  disturbance  torques.  It  is  evident  that  the  disturbance  torque  history  is 
very  significant  vis-a-vis  disturbing  flexible  dynamics  in  our  experimental  hardware; 
however,  certain  nonrandom,  nonlinear  effects  associated  with  the  bearing  friction 
catise  disturbances  that  are  highly  correlated  in  time  and  are  not  well  represented  by 
the  present  white-noise  model  of  the  disturbance  torques.  In  spite  of  the  substantial 
disturbance  torques  (Figure  3.9),  however,  it  is  evident  that  the  simulations  indicate 
that  the  closed-loop  flexible  body  dynamics,  in  fact,  follow  the  near-minimum-time 
rigid-body  motion  closely  while  effectively  suppressing  vibration,  as  shown  Figure 
3.10.  In  addition  to  the  variables  graphed  in  Figures  3.9  and  3.10,  we  confirmed 
that  the  energy  of  the  first  10  modes  was  effectively  suppressed.  These  simulated 
results  are  very  consistent  with  the  experimental  results  discussed  in  the  following 
section  and  those  presented  in  [Junkins  1991,1993]. 


Experimental  Results 

In  all  of  the  experiments  in  the  following  discussion,  the  target  final  angle  is  set 
to  40®  and  u^ax  =  400  oz-in.  A  det^led  description  of  the  hardware  is  given 
in  Appendix  I.  We  overview  the  system  as  follows:  the  configuration  (Figure  3.4, 
Table  3.2)  has  a  span  of  approximately  9  ft  and  has  six  natural  frequencies  below 
20  Hz.  The  system  is  accurately  balanced,  and  the  four  aluminum  appendages* 
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Tunc  [see] 

Figure  3.10.  Closed-loop  40®  maneuver  with  random  disturbances 


geometric,  mass,  and  stif&iess  parameters  are  matched  to  high  precision;  the  first 
three  measured  cantilevered  natural  frequencies  of  the  four  individual  beams  were 
found  to  be  identical  to  within  0.05  Hz. 

With  this  design,  the  appendages  vibrate  almost  exclusively  in  the  horizontal 
plane;  the  hub  is  balanced  on  a  custom-designed  needle-jewel  bearing  that  constrains 
the  hub  to  rotate  about  the  vertical  axis.  Our  measurements  confirm  that  negligible 
out-of-plane  motion  occurs  in  our  experiments,  although  there  is  occasional  evidence 
of  small  beam  torsional  vibrations.  Also,  to  very  high  accuracy,  we  can  state  that  om 
experimental  results  confirmed  that  only  the  antisymmetric  in-plane  modes  [implicit 
in  the  derivation  of  Eqs.  (3.83)]  were  excited  during  rest-to-rest  maneuvers  using 
the  hub  torque  actuator.  The  bearing  stiction/friction  torque  is  significant  20 
oz-in.),  but  is  sufficiently  small  and  predictable  to  permit  meaningful  experiments. 
Aerodynamic  damping  is  important  only  during  the  most  rapid  slew  maneuvers; 
in  most  cases,  it  represents  a  small  perturbation  as  compared  to  the  larger  active 
vibration  damping  introduced  by  the  feedback  controller.  The  control  torque  is 
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achieved  by  of  a  reaction  wheel  mounted  to  the  shaft  of  a  DC  motor  [Figure 

3.4(c)],  which  is,  in  turn,  mounted  to  the  hub.  The  commanded  motor  torque 
is  achieved  by  precision  current  control  using  power  amplifiers,  as  described  in 
Appendix  I  of  [Junkins  1993].  The  angular  rotation  of  the  hub  is  measured  using  a 
Teledyne-Gurley  angle  encoder,  accurate  to  about  0.01®,  whereas  the  toot  bending 
moment  and  shear  force  estimates  are  derived  from  conventional  full-bridge  strain- 
gauge  measurements.  The  derived  estimates  of  the  angular  velocity  history  have 
a  variance  of  approximately  l®/s  and  a  time  lag  of  0.01  s.  The  noise  and  phase 
lag  in  the  angular  velodty  estimates  and  the  strmn-gauge-derived  root  shear  force 
and  bending  moment  estimates  limit  the  bandwidth  of  the  closed-loop  qrstem  to 
the  range  from  approximately  0  to  10  Hz.  The  errors  (noise  and  phase  lag)  in  the 
derived  hub  angular  velocity  estimates  represent  the  mdn  source  of  the  predsion 
and  bandmdth  constraints  of  the  experimental  implementations.  The  control  loops 
were  dosed,  for  all  experiments  discussed  later,  at  75  Hz;  the  angle  encoder  was  also 
sampled  at  75  Hz,  whereas  the  strain  gauges  were  sampled  an  order  of  magnitude 
faster,  and  filtered  to  reduce  the  effects  of  sensor  noise  and  higher-frequency  modes 
outside  the  bandwidth  of  our  controller. 

Figure  3.11  shows  the  experimental  system  response  for  a  maneuver  using  control 
law  I  [the  constant  gain  control  law  of  Eq.  (3.88)]  with  gi  =  600  oz-in./rad,  g2  = 
800  oz-in./rad/s,  and  ga  =  0.  Even  though  control  law  I  [Eq.  (3.88)]  is  anticipated 
to  be  poorly  suited  for  large-angle  maneuvers,  we  nonetheless  apply  this  law  to 
carry  out  40®  maneuvers  to  provide  a  reference  for  the  subsequent  discussion.  Since 
the  initial  position  error  is  large,  the  maneuvers  start  from  zero  with  a  large  initial 
discontinuity  to  a  large  torque.  For  this  gain  selection,  we  see  a  large  hub  angle 
overshoot  (^10®)  and  significant  structural  vibration  that  was  effectively  suppressed 
by  around  12  s;  the  control  was  terminated  at  16  s.  These  results  were  repeatable; 
however,  the  residual  angle  was  typically  ^  0.25®  because  the  constant  gain  gi 
could  not  be  set  sufficiently  large  to  overcome  terminal  bearing  stiction  without 
causing  initial  actuator  saturation  and  large  overshoots,  and  a  compromise  value 
was  adopted  for  the  sake  of  illustration.  As  is  demonstrated  in  Ref.  5,  the  overall 
maneuver  shape  and  settling  time  is  sensitive  to  the  gains  selected;  however,  less 
than  10%  reductions  in  the  12  s  settling  time  can  be  achieved  without  initially 
saturating  the  actuator. 

Control  law  II,  on  the  other  hand,  leads -to  very  attractive  near-minimum-time 
maneuvers.  One  feasible  set  of  gain  settings  and  torque  shaped  parameters  leads 
to  the  experimental  results  shown  in  Figure  3.12.  The  effect  of  using  a  smooth, 
judiciously  shaped  reference  torque  history  is  evident  if  one  compares  the  output 
and  control  variable  histories  in  Figure  3.12  with  those  of  Figure  3.11.  This 
implementation  of  control  law  II  produced  much  smaller  overshoot  («  1.5®  vs  •v  10®) 
and  shorter  maneuver  time  (6  s  vs  12  s),  and  greatly  reduced  the  severity  of  peak 
vibration,  compared  to  control  law  I.  These  results,  especially  when  considered 
in  conjunction  with  numerous  other  cases,  are  reported  in  [Junkins  1990]  and 
[Thompson  1989],  provide  convincing  evidence  that  control  law  II  is  a  versatile 
and  highly  effective  way  to  incorporate  open-loop  torque-shaped  optimization  with 
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Figure  3.11.  Bxpezimental  results:  a  40^  maneuver  using  control  law  I  =  600,  $2  =  SOO, 
9Z  =  0.0). 


en  route  and  terminal  vibration  suppression.  The  fact  that  a  globally  continuous 
control  structure  is  implicit  in  this  approach  leads  to  minimal  difficulties  in  realizing 
robust  control  laws. 

We  encountered  several  practical  difficulties  in  our  experimental  work,  but 
these  difficulties  are  not  central  to  our  control-law  design  approach.  First,  the 
root  shear  force  and  bending  moment  approximations  obtained  using  str^n-gauge 
measurements  resulted  in  sufficiently  noisy  and  nonlinear  measurements  that,  using 
this  feedback  (ga  ^  0),  only  marginally  improved  the  controlled  response  over,  for 
example,  the  results  in  Figure  3.11.  These  anomalies  resulted,  we  hypothesize,  from 
the  nonideal  beam-clamp  effects  near  the  station  where  the  strain  measurements 
were  being  made.  Any  slight  play  in  the  clamp  due  to  large  root  moment  variations 
would  manifest  itself  in  spurious  strain  measurements.  Also,  deriving  the  angular 
velocity  estimate  from  the  noisy  angle-encoder  readout  was  difficult  to  accomplish 
with  high  precision  and,  as  a  consequence,  we  constructed  a  digital  filter  to  process 
the  angle  encoder  data  and  roll  off  the  frequency  content  in  the  rate  estimates 
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Figure  3*12«  Experimental  results;  a  40®  maneuver  using  control  law  II  (pj  =  3000,  g2  —  800, 
g2  ss  0.0,  Of  —  0.2,  UnxAz  ss  400}* 

above  10  Hz.  We  found  this  was  useful  to  avoid  erroneous,  phase-lagged  high- 
frequency  components  of  the  feedback  that  disturbed  the  higher-frequency  modes. 
These  problems  can  be  essentially  eliminated,  of  course,  by  investing  in  a  more 
precise  sensor  to  measure  angular  displacement  and/or  angular  velocity,  as  well  as 
a  load  cell  to  measure  the  root  shear  and  bending  moments.  Finally,  our  bearing 
presented  us  with  another  set  of  practical  difficulties.  Based  on  analysis  of  our 
bearing  hardware,  it  became  evident  that  interaction  of  the  structure  with  the 
bearing  accounts  for  the  overwhelming  source  of  disturbance  torques.  The  bearing 
Mction/stiction  model  developed  from  our  analysis  [J unkins  1990]  has  the  form 

twing  =  -cisign(0)  -  cjff  +  HOT  (3.102) 

where  we  find  ci  ^  20  oz-in.  and  cj  0.001  oz-in./rad/s. 

Thus,  the  first  (stiction)  term  of  Eq.  (3.102)  dominates  the  bearing  torque 
for  moderate  9  and  is  about  5%  of  the  peak  commanded  torque  of  400  oz-in. . 
Although  we  believe  that  Eq.  (3.102)  models  the  bearing  friction  well,  we  found 
that  it  is  difficult  to  use  this  model  to  compensate  for  bearing  friction  in  real  time 
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becatise  angle-encoder  noise  results  in  uncertainty  in  the  estimated  instants  that  6 
switches  sign.  This  difficulty  has  significant  practical  consequences.  If  we  modify 
our  control  to  compensate  for  bearing-disturbance  torques  (essentially,  attempt  to 
cancel  it)  using  Eq.  (3.102),  the  commanded  discontinuity  (at  the  estimated  time 
that  h  changes  sign)  will  not  coincide  exactly  with  the  actual  stiction  discontinuity; 
even  slightly  mistimed  compensation  torque  discontinuities  can  actually  worsen  the 
disturbance!  Although  we  experimented  with  several  bearing-torque  compensation 
schemes,  we  ultimately  decided  simply  to  consider  bearing  torque  an  anticipated 
and  well  modeled  disturbance.  Our  simulations  (such  as  the  restilts  shown  in  Figure 
3.10)  indicated  that  our  control  approach  could  easily  tolerate  disturbances  of  this 
magnitude,  and  our  successful  experiments  in  Figures  3.11  and  3.12  and  [Junkins 
1990]  certainly  confirm  that  our  implemented  control  laws  are  robust  in  the  presence 
of  the  actual  disturbances  &om  all  sources. 

This  case  study  provides  a  good  illustration  of  the  mix  of  theoretical  analysis, 
numerical  computation,  and  engineering  judgment  required  to  carry  out  successful 
applications.  The  ultimate  objective,  of  course,  is  to  obtain  perfect  closure  between 
theory  and  experiment.  However,  it  is  not  realistic  to ‘expect  the  high  degree  of 
closure  obtained  above,  when  faced  with  more  complicated  dynamical  systems.  Note 
that  excellent  results  were  obtained,  in  spite  of  modest  investments  in  sensors  and 
actuators;  however,  for  systems  requiring  high  precision  and  wide  control  bandwith, 
it  would  be  necessary  to  have  corresponding  improvements  in  the  precision  and 
bandwith  of  the  sensors.  In  the  context  of  the  above  numerical  and  experimental 
results,  however,  we  observe  that  a  large  degree  of  model-error  robustness  implicit 
m  our  approach  stems  from  our  theoretical  verification  that  the  control  of  Eq.  (3.88) 
remains  stabilizing  for  most  of  the  usual  variations  in  modeling  assumptions,  and  we 
used  judicious  sensor  filtering  to  roll  off  the  effects  of  the  system  dynamics  outside 
the  sensors’  bandwidth.  In  conclusion,  the  excellent  agreement  between  theory  and 
experiment  evident  in  Figures  3.10  and  3.12  represents  prototypical  (rather  than 
usual)  results. 

3.7  CONCLUDING  REMARKS 

In  this  chapter,  we  have  summarized  the  central  aspects  of  Lyapunov  stability 
theory  with  particular  emphasis  upon  the  role  that  it  can  play  in  designing  stable 
controllers  for  nonlinear  multibody  systems.  Several  elementary  analytical  and 
numerical  examples  are  provided  to  illustrate  the  ideas  and  to  provide  some  basis 
for  extrapolating  the  practical  implications  of  the  methods  presented.  A  more 
extensive  example  is  offered  to  introduce  some  ideas  on  cooperative  control,  in 
which  two  or  more  manipulators  are  manipulating  a  payload  while  cooperating  with 
each  other  to  minimize  a  measure  of  the  associated  control  and  constraint  forces 
and  moments.  The  chapter  concludes  with  an  example  wherein  maneuvers  are 
designed  for  a  multibody  flexible  structure  and  good  dosure  is  obtained  between 
the  analytical,  computational,  and  hardware  experimental  results.  These  results 
support  the  theoretical  and  practical  value  of  these  developments. 
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=>  Prototype  problems  from  A/C  flight  mechanics,  sensing/actuation, 
aeroelasticlty,  &  associated  dynamics/controPstability  issues. 


OUTLINE 


Motivation/Relevance/Approach 

Key  Results/Examples 

Status  and  "Where  to  From  Here? 


Approach  to  Nonlinear  Structural  Response 
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Answer!  Yes,  we  h3.ve  found  un  elegunt/geneml  solution  to  this 
fundamental  problem  =»  Improves  the  basic  accuracy/ 
speed/numerical  stability  tradeoffs  by  over  an  order  of 
magnitude. 


An  Orthogonal  Quasi-Coordinate  Formulation  of 
Dynamical  Models  for  Nonlinear  Structural  Systems 


Classical  Approach 


angular  accelerations  [rad/s]  configuration  angles 


A  Low-Dimensioned  Example 


Nonlinear  Mechanism 


Unconstrained  Free  Response 
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Constrained  Free  Response 


Instantaneous  Mass  Matrix  Eigenvalues 


Method  V. 
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Method!: 


Some  Simulations 

state  vector  is  {v,  x},  using  orthogonal  quasi  -  coordinate  approach 

statevectoris{x,x},  usingM-^x)  =  to  solve  for  x 

statevectoris{x,x},  using LDV decomp.  ofM(x)  tosolveforx 
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A  Stable  Transition  Between  Flight  Modes: 

Accomplished  via  a  Novel  Compromise  Between 
Fixed  and  Rotary  Wing  Configurations 
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Wing  stabilization  is  passive  and  inherent  in  the  design. 

Gust  response  is  greatly  reduced  (~  one  order  of  magnitude). 
Unusual  Near-VTOL,  Loiter,  and  Endurance  capabilities  ... 


Auto-Stable  Free  Wing:  Essential  Idea 

Consider  a  Free  Wing  in  trim: 

=>  The  Free  Wing  can  be  thought  of  as  a  'variable  geometry  lifting  weathervane' 
=>  Due  to  the  free  pivot,  for  trimmed  flight,  both  the  wing  and  the  aircraft  must  be 
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of  the  thrust  T  =>  gain  access  to  an  infinite  set  of  trimmed  flight  modes  for  the  AJC. 

Satisfying  trim  conditions  obviously  does  not  guarantee  stable  flight  of  the  aircraft. 

=>  The  aerodynamics  are  coupled  to  the  tilt  angle  and  thrast,  and  dynamic  stability 
analysis  of  variable  geometry  aircraft  is  inherently  non-linear  and  non-trivial! 

=>  Research  Issues-  flnid/structure  interaction,  stability/cor-f'-'^l  aeroelastic  effects 


Acceleration 


Nonlinear  Gust  Response  for  The  Freewing 
Scorpion  Vehicle  in  Loiter  Mode 

U^=52ft/s  {~36mph))  k 


Vertical  Harmonic  Gust:  Y 

A  siuOdt  Landing/Takoff/Loiter  Mode 


Typical  Nonlinear  IIAccelerationll  Responses  vs  time 


A  New  Approach  to  Nonlinear  Structural  Response;  Summary 
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Problem:  Dynamic  response  simulations  for  DPS,  especially  for 


c/D 


o 

4-> 

<6 

q 

o 

> 

•  rH  .  _ ; 

T  "H 

<o 

o 

T3 

bX)  M  *3 
2  O  a 

a 

© 

•  rH 

13 

.tn 

.  4  ^ 

<2 

&  Oh 

Cf-( 

•  'tH 


O  ^  ^ 

Is  'S  ^ 

cd  O 

OJ  jD 

^2  g 

§11 

ri  B  H 


^  § 
(/)  Q 

M  g 


o  w).52 

S  a 

£: 

c/i  O 
(D  Co 


«/) 

c/i 

•  fH 

(D 

H 


(D 

’T^ 

O 

S 


a 

o 

i 

N 


q 


<D 

O 


c3  oq 


q 

©  'xs 


g 

o 

*'  ’T^ 

I 


5^ 


q 

4-> 

"q 

4-4 

CO 

q 

(D 

T— < 

■  \ 

13 

© 

Co 

cd 

B 

<D 

q 

2 


v-i 

<2 


O 


Co 


q  *;r*  ^ 

(D  O 


o 

•  rH 


o 

r-H 

q 

Vh 

o 


©  S  Q  '+H 

©  ^  Co 

'Ki 

Si 


CfH  ^ 
(D  Jj 
JH  Co 

O  4^ 

•pS  Si 

q 

3  S 


q 

> 

^-( 

o 


k. 


2  ^ 
©  '>3 

00 

CO  ^ 


© 

H 


CO 

© 

O 

■§ 

A 

A 

il 


§ 

q  ^ 

o  ^ 

ip  ^ 

o  ^ 
5q 


»  o 
q  ^ 
1  g 


i 

(/2 


;s 

s 

>l>i>a 

00 


Co 

Si 

O 


4<k 


rs 

4-4 

Si 


Si 

Co 

s 

O 

Si 

g 


o 

Co 

4-4 

O 

I 

(D 

Vh 

(D 

q 

o 

bX) 


(D 

q 

*P 

q 

o 

u 

<D 

§ 

u 

•  • 

q 

o 

CO 

© 

q 

a 


329 


is  a  near  neighbor  of  the  problem  we  seek  to  solve? 
Answer:  A  qualified  yes  =»  Attractive  solution  validation  tool. 


Flow  Chart  for  Construction  of  Exactly  Solved 
Benchmark  Problems  Near  an  Approximate  Solution 


GIVEN  A  DYNAMICAL  SYSTEM 
x(t)  -f{t,x,x,p), 
where  p  is  the  model  parameter  vector 
x(to)  =  Xo,  x(to)  =Xo,  to^t<  tf 


GIVEN  A  NUMERICAL  SOLUTION  PROCESS 
{xi,X2,"-,Xn},  where  xj  =x(ti) 


ORTHOGONAL  CHEBYSHEV  APPROXIMATION 

Xb  (.0  =  smooth  interpolation  of 

{xi,X2,—,Xn} 


INVERSE  DYNAMICS 
e(t)  =  Xbit) -f(t,Xh{t),Xb{t),p) 


BENCHMARK  PROBLEM 

The  known  interpolated  solution :  (  ^ ) 

exactly  satisfies  the  differential  eqns 
^(0  =f(t,x,x,p)+e(t), 
with  the  boundary  conditions: 
x{to)=Xl,{to),  x{to)=Xb{to),  to<t<tf 


Example  ODE  System 

x(t)  =f{t,x,x,p)  +  e(t)  =  +  +  sin{t)  +  eit) 


ei 


331 


A  Three  Body  Distributed  Parameter 


Hub  Inertia  I  hub 
Hub  Radius  r  S 


Example  ODE/PPE  Hybrid  System 


Benchmark  System  Model:  Given  interpolated  {y(f,  Jc), 

{5f(t,x),  5k(0,  5/fi>(0.  Sm^p  }  to  exactly  satisfy  the  hybrid  system  of  odes/pdes: 

j8+ j‘'p{x+r)[y  +  (x+r)Q]dx+m(L+r)[(L+r)6+y]-J^{fM  +  5fi.tMi’‘  +  r)dx 
+j[8  +  y']  +  (L  +  r){f,;p  +  5/,ip}  +  {K,ip  +  6«,;p}  +  {K  +  8«}  =0  =>  slep4:  6»(t) 

p[y+(x+r)6]+£//"'-{«t,x)  +  5/(t.x)}=0  =>  stepl:  hf(t,x) 

with  the  boundary  conditions'. 

EIf\t,L)-m[(L-^r)^  +yit,L}]  + {f,ipit)  +  5f,ip{t)]  =0  step2:  Sf„p(l) 

£//'(f,L) + J[S  +y'(f,L)]  -  {Ktfp(t)  +  8k,(p(0}  =0  =«>  ^‘ep3:  Su,ip(0 


Convergence  Study  on  a  Distributed  Parameter  System 
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Where  to  from  here? 

Study  more  applications. 


